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Abstract. In this paper we introduced and developed the theory of Modified Interior Distance 
Functions(MIDF s). 

The MIDF is a Classical Lagrangian (CL) for a constrained optimization problem, which is 
equivalent to the initial one and can be obtained from the latter by monotone transformation both the 
objective function and constraints. 

In contrast to the Interior Distance Functions (IDFs), which played a fundamental role in 
Interior Point Methods (IPMs), the MIDFs are defined on an extended feasible set and along with 
center, have two extra tools, which control the computational process: the barrier parameter and the 
vector of Lagrange multipliers. 

The extra tools allow to attach to the MIDFs very important properties of Augmented 
T .a g rangeans One can consider the MIDFs as Interior Augmented Lagrangeans. It makes MIDFs 
similar in spirit to Modified Barrier Functions (MBFs), although there is a fundamental difference 
between them both in theory and methods. 

Based on MIDFs theory. Modified Center Methods (MCMs) have been developed and 
analyzed. 

The MCMs find an unconstrained minimizer in primal space and update the Lagrange 
multipliers, while both the center and the barrier parameter can be fixed or updated at each step. 

The MCMs convergence was investigated, and their rate of convergence was estimated. The 
extension of the feasible set and the special role of the Lagrange multipliers allow to develop MCMs, 
which produce, in case of nondegenerate constrained optimization, a primal and dual sequences that 
converge to the p rimal -dual solutions with linear rate, even when both the center and the barrier 
piinfflf tf f are fixed. Moreover, every Lagrange multipliers update shrinks the distance to the primal 
dual solution by a factor 0 < y < 1 which can be made as small as one wants by choosing a fixed 
interior point as a "center" and a fixed but large enough barrier parameter. 

The numerical realization of MCM leads to the Newton MCM (NMCM). The approximation 
for the primal minimizer one finds by Newton Method followed by the Lagrange multipliers update. 

Due to die MCM convergence, when both the center and the barrier parameter are fixed, the 
condition of the MIDF Hessians and the neighborhood of the primal minimizer where Newton method 
is "well” defined remains stable. 

It contributes to both the complexity and the numerical stability of the NMCM. 
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1. Introduction. 

In the mid 60s, P. Huard [ BuiH66], [Huar67a] and [Huar67b] introduced Interior Distance 
Functions (IDFs) and developed Interior Center Methods (ICMs) for solving constrained optimization 
problems. Later these functions, as well as Interior Center Methods, were intensively studied by A. 
Fiacco and G. McCormick [FiacM68], K. Grossman and A. Kaplan [GrosK81], R. Mifflin [MifF76], 
and E. Polak [PolE71], just to mention a few. 

It was found [PolE71] that there are close connections not only between the IDFs and the 
Barrier Functions [FiacM68], but also between ICMs and methods of feasible directions [Zout60], 
[ZPP63], 

The ICMs consist of finding at each step a central (in a sense) point of the Relaxation Feasible 
Set (RFS) and updating it in accordance with the objective function level that has already been 
attained. The RFS is the intersection of the feasible set with the Relaxation (level) set of the objective 
function at the attained level. In the Classical ICM the "center" is sought as a minimum of the IDF. 

Interest in the IDFs, as well as in the Barrier Functions (BFs), grew dramatically in connection 
with the well known developments in mathematical programming during the last ten years since N. 
Karmarkar published his projective scaling method [Kar84], In fact, his potential function is an IDF 
and his method is a Center Method, which produces centers of spheres, which belong to the feasible 
polytop. The concept of centers has a long and interesting history. 

In the 60s, concurrently with P. Huard's ICM, the Gravity Center Method was independently 
developed by A. Levin [Lev65] and D. Newman [New65], the Affine Scaling (ellipsoid centers) 
Method by I. Dikin [Dik67], and the Chebyshev Center Method by S. Zuchovitsky, R. Polyak and 
M Pimak [ZPP69]. The Affine Scaling Method, which one can view as a method of feasible direction 
with special direction normalization, was rediscovered in 1986 independently by E. Barnes [Bam86] 
and R. Vanderbei, M Maketon and B. Freedman [VanMF86] as a simplified version of Karmarkar's 
method. 

In the 70s, N. Shor [Sh70] and independently D. Yudin and A. Nemirovsky [YuN76] 
developed the ellipsoid method, which generates centers of ellipsoids with minimal volume 
circumscribed around some convex sets. Using this method, L. Khachiyan [Kh79] was the first to 
prove in 1979 the polynomial complexity of the Linear Programming problem. His result had a great 
impact on the complexity theory, but numerically the ellipsoid method appeared to be not efficient. 
It is interesting to note that the rate of convergence, which was established by I. Dikin [Dik74] for 
the Affine Scaling Method, in case of nondegenerate linear programming problems, is asymptotically 
much better than the rate of convergence of the ellipsoid method and numerically, as it turned out, 
the Affine Scaling Method is much more efficient [AdRVK89]. 

The concept of centers became extremely popular in the 80s. Centering and reducing the cost 
are two basic ideas that are behind the developments in the Interior Point Methods (IPMs) for the last 
ten years. Centering means to stay away from the boundary. A successful answer to the main 
question: how far from the boundary one should stay, was given by Sonnevend [Son85] (see also 
[JarSS88]) through the definition of the analytic center of a polytop. The analytic center is a unique 
minimizer of the Interior Distance Function. The central path - curve, which is formed by the analytic 
centers, plays a very important role in the IPM developments. It was brilliantly shown in the paper 
by C. Gonzaga [Gon92], 

Following the central path J. Renegar [Ren88] obtained the first path-following algorithm 
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with O(VnL) number of iterations against 0(nL) of the N. Karmarkar's method. 

Soon afterwards, C. Gonzaga [Gon88] and P. Vaidya [Vaid87] described algorithms based 
on the centering ideas with overall complexity O(r^L) arithmetic operations, which is the best known 
result so far. 

In the course of the 30 years history of center methods it became clear that both the theore- 
tical importance and the practical efficiency of the center type methods depends very much on the 
"quality" of the center and on the cost to compute the center or its approximation. 

The center was and still is the main tool to control the computational process in a wide variety 
of center methods in general and in IPMs in particular. 

However, still there is a fundamental question, which has to be answered: how consistent the 
main idea of center methods - to stay away from the boundary with the main purpose of 
constrained optimization - to find a solution on the boundary. 

In this paper we will try to address this issue. The purpose of this paper is to introduce the 
Modified Interior Distance Functions (MIDFs) and to develop their theory. Based on this theory, we 
are going to develop the Modified Center Methods (MCMs), to investigate their convergence and 
to establish their rate of convergence. 

The MIDFs are particular realizations of the Nonlinear Rescaling Principle [P0I86], which 
consists of transforming a constrained optimization problem into an equivalent one and using the 
Classical Lagrangean for the equivalent problem for both theoretical analysis and numerical methods. 
In the case of MIDFs, we transform both the objective functions and the constraints by monotone 
transformations. The constraints transformation is parametrized by a positive parameter. The MIDF, 
which is a Classical Lagrangean for the equivalent problem, has properties that make it substantially 
different from both IDF as well as Classical Lagrangean for the initial problem. 

Instead of one tool (the centers), which controls the process in the IDF, the MIDF has three 
tools: the center, the barrier parameter and the vector of Lagrange multipliers. Two extra tools 
provide the MIDF with very important properties. 

The barrier parameter not only allows to retain the convexity of the MIDFs when both the 
objective function and the constraints are convex, it also allows to "convexity" the MIDFs in the case 
when the objective function and/or the constraints are not convex but the second order optimality 
conditions are satisfied. The barrier parameter is also crucial for the rate of convergence of the 
MCMs. 

The other critical extra tool is the vector of Lagrange multipliers. It allows to attach to the 
MIDFs nice local properties of Augmented Lagrangeans [Ber82], [GolT89], [Hes69], [Man75], 
[Pow69], [Rock74] and provides them with new important features. 

One can consider MIDFs as Interior Augmented Lagrangeans. However, in addition to the 
nice local Augmented Lagrangean properties, the MIDFs possess important global properties, which 
manifests itself when the Lagrange multipliers are fixed and one changes the barrier parameter and/or 
the center to approach the solution. 

What is most important, the MIDFs are defined and keep smoothness of the order of the initial 
functions on the extension of the feasible set. 

The special MIDFs properties allows to develop MCMs, which produce the primal-dual 
sequences that converge to the primal-dual solution, even when both the center and the bamer 
parameter are fixed. Moreover, under nondegeneracy assumptions the primal and dual sequences 
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converge to the primal-dual solution with linear rate. 

So the main driver in MCMs is the vector of Lagrange multipliers rather than the center or 
the barrier parameter. It allows not only to stay as close to the boundary as one wants, it makes 
possible for the primal minimizer to be even outside of the feasible set. At the same time, in constrast 
to the IDF, the solution of the constrained optimization problem is always inside of the level set of 


the MIDF. 

So at the final stage, the MCMs are closer to the multipliers methods rather than to IPM. 
As far as the initial stage is concerned, when the vector of Lagrange multipliers is fixed, then one can 
take advantage of the global-self concordance [NesN94] properties of the DDF, which guarantee a 
very re asonab le convergence by following the central path, which one obtains by changing the bamer 
parameter or the center. 

The numerical realization of the MCMs leads to the Newton MCM. In contrast to the IDF, 
the MtDF Hessian not only exists on an extended feasible set but due to the MCMs convergence, 
when both the center and the bamer parameter are fixed, the condition number of the MIDF Hessian 
is stable and so the neighborhoods of the primal minimizers, where the Newton method is "well" 


defined [Sm86]. 

It contributes to substantial reduction of the number of Newton steps per Lagrange multipliers 
update from step to step. Every update shrinks the distance to the primal-dual solution by a fixed 
factor 0 < y < 1, which depends on the input data and the size of the problem. It can be made as small 
as one wants even when both the "center" and the barrier parameter are fixed, but the parameter is 
large enough. 

The paper is organized as follows. After the statement of the problem, we discuss the IDFs 
properties and introduce the MIDFs. Then we establish the basic MIDFs properties at the primal- 
dual solution and compare them with the correspondent IDFs properties. Then we prove the basic 
theorem, which is the foundation for the MCMs and their convergence. We describe the MCMs and 


analyze their convergence. 

The MIDFs have some common features with Modified Bamer Functions [Pol92], but there 
are fundamental differences between them as well. We illustrate the differences using a few small 
examples. 

Then we describe the Newton MCM, which is a numerical realization of the MCM. 

We conclude the paper by considering dual problems, that are based on MIDF, They have 
some distinctive features, which we will discuss briefly. 
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1. Problem Formulation and Basic Assumptions. Let f 0 (x) and -/(*), t - 1 , .jn be convex, 

C 2 -function in R* and there exists 

x* * argmin {^(x)/xeQ} (1.1) 

where Q = { x:/(x ) a 0, / = l, 

We will assume that Slater condition holds, i.e. 

3 x°:/(x°)>0 ,/* (12) 

So the Karush-Kuhn-Tucker's (K-K-Ts) optimality conditions hold true, i.e. there exists a vector 
u * * a 0T such that 

i' (x*«*) = />*)- £«;/ / l (*‘)=o-,y;(x>;=o, (i.3> 

* /-I 

M 

where L(x,u) * f 0 (x) - Elf,/ (x) is the Lagrange function for (1.1) and 

M 

f f t (x) * gra d/(x), i * Q,...jn, are row-vectors. Let /* * it:f t (x *) » 0 } » { l,.../) is the active 
constraints set and r ift . 

We consider the vector-function /(x) * (/JO*) »—/,(*)) >^ e vector-function of active 
constraints / r) (x) * (/J(x),.../ f (x» and the vector-function of passive constraints/ ^(x) = 

We also consider their Jacobians/^x) m J (f(x)) t f ^ (*) *^(/( r )( x ))» 

diagonal matrices 1/ ■ [diagtr, ]"i , t/ f * [diag u, ]^ ml with entnes / = 1, . .. m and Hessians 




V, 


ax t ax t 


t » 


1,-V* 

1 


i -0,1, -JR, 
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of the objective function and constraints. The sufficient regularity condition 

rank «/>0, tel' (14) 

together with the sufficient condition for the minimum x‘ to be isolated 

(i"„(* , .ti , )».i) * *(*.*). *>« V»*0 :/*(,)(* ■)** O' (15) 

comprise the standard second order optimality condition, which we will assume in this paper. 

We shall use the following assertion, which is a slight modification of the Debreu theorem (see [Pol 
92]). 

Assertion 1 Let A be a symmetric n * n matrix, B be an r * n matrix, and 

U * [diag tJ,]'.! :R r ~R r , where u = (u, u r ) > O' and let 

(A y, y) a A. (y,y), k>0,Vy:By a ‘O r 
Then there exists k 0 > 0 such that for any 0 < p < X the following inequality 

((4 + kB T UB)x, jr)in(jr,x), VxeR* 
holds true whenever k 2 k^ 

2. Interior Distance Function. Let^ e int Q and a * f 0 (y), we consider the Relaxation Feasible Set 
(RFS) on the level a : Q(a) ■ Qfl{x:^(x) s a} and an interval T*ix : a<t<a* =4(x*)) 
The Classical IDFs F(x,a) and H(x, a) : Q(«)xf-4 R 1 are defined by formulas 

F(x, a ) - -m ln(a -f Q (x)) - £ In ffic) ; H(x, a ) - m (a ~f Q (x)Y x + Y,ft (*) 

Let us assume that In t m and r 1 = 00 for t & 0, the Classical Interior Center Methods 
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(ICMs) consists of finding the "center" of the RFS by solving the following unconstrained 
optimization problem 

£ = x(a ) = argmin { F(x, a ) /x e E" } 

and updating the objective function level a , i.e., replacing a by a =f 0 (x) Due to 
x->dQ(a) -*F(x,a)-« the new center $(a) e int Q(a) c Q for any a e T 
Moreover, if the IDF possess the self-concordance properties (see [NesN 94]) the central trajectory 
{ x( a ), a € T) has some very special features (see [Ren 88] and [Gon 92]). 

Starting at a point close to the central trajectory - "warm" start - for a particular a e T and 
using Newton step for solving the system 

F x (x,a)-0" 

in x following by a "careful" a update, one can guarantee that the new approximation will be again 
a "warm" start and the gap between the current level a = f 0 (x) and the optimal level a -fix’) will 
be reduced by a factor 0 < q, < 1, which is dependent only on the size of the problem. 

However along with these nice properties the IDFs have their well known drawbacks. Neither 
theIDFsF(x,a) and// (x, a) nor their derivatives exist at the solution. Both F(x ,a) and H (x ,a) 
grow infinitely when x(<x ) approaches the solution. 

All constraints contribute equally to IDFs and one can obtain the optimal Lagrange 
multipliers only in the limit when £(<x )— » x * . What is particularly important for nonlinear 
constrained optimization is the feet that the condition number of the IDF Hessians vanishes when the 
process approaches the solution. Let's consider this issue briefly, using F (x, a). Keeping in mind the 
boundness of the RFS Q (a) one can guarantee that the unconstrained minimizer $ e int Q ( a ) 
exists and 
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m ft (Q\ 

'•'.A-)-— ^sr/.C*)-S TT3T ■ O’ 


( 2 . 1 ) 


-/»(*> 


M /<*) 


or 


* ($) - v g ’A^L f> ($) = o" 
/oC } « »/(*> ' 


( 2 . 2 ) 


We define 


a, = Q t (u ) » ( « -/o ($)) (my; ($)r‘ , i - 


(2.3) 


and consider the vector of Lagrange multipliers 0"0(a) * (O^a) ,/* l,...^n), then (2.2) can be 
rewritten as follows: 

L (i. fi) */' 0 (S) - £ fi, /, (S) */» (S) - 0/ (®) - O’ 


/-l 


Also uf. ($) * (a -f Q (£)) m " l , / » 1 So £ (£) * a -f 0 (0). 

i-i 

Under the uniqueness assumptions (1.4) - (1.5) we have 

lim 0(a) = «* , lint *(«) = x * 

a -♦ a* a -» a* 

Let's consider the Hessian F n Jx, a ). We obtain 




V A ) 

« -/.(*) 

> 

\ mf t $) ; 

J 


/,(«> 


Further, for any 


1,.„, /n , we have 


« -/ 0 (*)' 

/ 

- Iff ^ 

-AW (« -/oW) 

k my;w ; 

— /FT 
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Therefore 


E 

/-I 


X 

x"-\ 

•a 

>• 

a 

,Y 

[l */fi) } 





— U oW E Z-fUrffi) * £ 7ST )Tf ® 

a -./£($) <-i m f t (*) <-i W -/<W /<(*) 


iar (/WE a,/<*) * E ^ (/WA(*) 


“ -/o(*) <•* 


LetD(x) - [diag /(x)]" ( , U(«) - [diag «,(« )]*,.*«»«» the H essuui F"(J,«) 
we obtain 

F" (S,a)-L"ji,S) »— 775: IfiWE “,(«)/',(*) 

** * 7 »W <■! 

In view of i *$(a) - x* , weobtain 

l » (S,C) - 1 "„(x -.«•). tr 0 («)f E *>,/.(*> -//(* •)/.(* ’). 

M 
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Therefore 


F" <*,«) .L"Jf > •) * vy* •)/£<«>/> •) ♦ (« -/„(i))'7'„ r (^ -)A(* •) (2 3) 


where £(a) 3 [ diag e ( (a)£.j and 

lim e t (a) = lim u,(ct) f,' X ($(<*)) a (2.4) 

a -» a* a -► a* 

The mineigval « ) is defined by the first two terms (2.3), therefore in view of (2.4) 

due to the Assertion 1 with A = L"„ (x, «*) and B (**) there exists ^ > 0 : 

mineigval F" st {x t (i) = p 

At the same time due to (2.4) we have maxeigval - • when a- a . Therefore the 

condition number of the Interior Distance Functions Hessians vanishes when £ approaches x The 
consequences of the ill -conditioning is much more substantial in nonlinear optimization than in Linear 
Programming. In case of LP the term L \ (* n) in the expression for the IDF Hessian disappears and 
by rescaling one can practically eliminate the ill-conditioning effect, at least, when the problem is not 
degenerate. 

In nonlinear optimization the situation is completely different and the ill conditioning was and 
still is an impor tant issue both in theory and practice. To eliminate the ill conditioning of the IDF we 
will introduce the Modified Interior Distance Functions. 


3. Modified Interior Distance Functions We consider a vector y € int 0 and A (y, x ) 
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/o (>0 -/o ( x ) > 0. then the Relaxation Feasible Set (RFS): 

Q(y) a 0,/* l,...,/n ; A(y,x)>0} 

The problem (1.1) is equivalent to 

x* = argmin{^(x)/xeQ(»} (3.1) 

It is easy to see that for any k > 0 

Q(y)*{x:ib- l [ln(ib);(jr) + ACF.x))-bACv,x)]aO ACv,x)>0) 

Therefore the problem (3.1) is equivalent to the following problem: 

jr * = argmin { -In A(y,x)/x e Q(y)} (3.2) 

Assuming In / * -» for / s 0 we define the MIDF F(x,y,u,k ): 

R" * intQ * R/ x R ++ ‘ - R 1 as a Classical Lagrangeah for the equivalent problem (3.2): 

F(x,y,u,k) - ( -1 +4T 1 Eu t ) In A (y,x) -k' x Eu t In (kf t (x) ♦ A ( y,x )) (3 3) 

<*1 M 

The MIDF F {x, y, u, k) corresponds to the IDF F(x, a ). To define the MIDF, which corresponds 
to H ( x, a), we first note that for any k > 0 
Q 00 3 { * : * ' l [(*/(*) + A (y.x))' 1 - A _1 (y,x) ] s 0 , A (y ,*)><)} 

Therefore the problem (1.1) is equivalent to 

jt* ■ argmin { A‘ l (y,x)/x€ Q(y)) (3.4) 

Assuming r l » — for / i 0 we define the MIDF H(x t y,u,k) : R" * intQ * R + " x R ^ R 1 
as a Classical Lagrangean for the equivalent problem (3.4): 

H (x,y,u,k) *(-l ♦ Jt' 1 Eu t ) A ’ l Cv,x) + k~ l Eu t (kf t (x) * AOvx))' 1 (3.5) 

i*t 1*1 
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The MEDF (3.5) corresponds to the P. Huard's IDF H (x , a). Both F(x, y, u, k) and H (x, y, u. k ) 
are Classical Lagrangeans for problems equivalent to (1.1), which we obtained by monotone 
transformation both the objective function and the constraints. 

Finally, the MIDF Q ( x,y,u,k ) : R" x intQ * R„" * R t+ l - R 1 , which is defined by formula 

• n + A O.jc 

/-I 

corresponds to the potential function 

e(*,«) -(«-/«(*))■• n /■’(*). 

fl 

So, we have F( x, y, u, k) - In Q ( x; y, u, k ) and all basic facts about F(x, y,u,k) remain true for 
Q(x,y,u,k ), therefore we will not consider the MIDF Q ( x, y, u, k ) further in this paper. 

There is a fundamental difference between the Classical and Modified Interior Distance 
Functions. First we are going to show the difference at the local level - in the neighborhood of the 
primal-dual solution. In the next section, we will consider the local MIDFs properties. 

4. Local MIDFs Properties In contrast to the IDFs, the MIDFs are defined at the solution, they 
do not grow infinitely when the primal approximation approaches the solution and under the fixed 
optimal Lagrange mul tipliers, one can obtain the primal solution by solving one smooth unconstrained 
optimization problem. 

Proposition 1 . For any k > 0 and any y e int D , the following relations are taking place. 

PI. F(x*,y,u* t k) ■ -lnA(y,x*) i.e. f 0 (x*) •f 0 (y) -exp (-F(x 
and 

H(x \y,u‘,k) = A' 1 (y,x*) i.e. / 0 (**) >,«%*) 

The property PI follows immediately from the definition of MIDFs and the complementary 
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conditions for the K-K-Ts pair (x'.u ): 

«//(**) =0, i = \,...jn. 

The fact that the MIDFs value at (x,u) coincides with the optimal objective function value 
for the equivalent problem independently on both the center y € int Q and barrier parameter k > 0 
indicates that one can approach the solution by means other than those, which have been traditionally 
used in the IPM developments. 

Proposition 2 . For any k > 0 and any y € int Q , the following relations are taking place. 

P2. F' x {x\y t u',k) = A' l Cv^*)I / x (Jt*.M*) s '° 

and 

H' x (x *,y,u \k) - A ' 2 (y,x * ) L ' X (x \i i *) * 0 
The proposition 2 immediately follows from the definition of MIDFs and K-K-Ts conditions. 
If k> E 1 1 * the unconstrained minimizer of F(x, y, u, k ) or H{ x, y, u, k ) in x is a solution of the 
convex programming problem (1.1), i.c., the following property is taking place. 

Proposition 3 

P3. x * = argmin ( F(x,y,u * Jt) Ixf. I" } * argmin { H(x,y,u *^) /xeV) 

In other words, the knowledge of the optimal Lagrange multipliers allows us to solve the problem 
(1.1) by solving one unconstrained optimization problem. Therefore if F (x, y, u, k) is strongly 
convex in x and we know a good approximation u for the vector u , then i = x(y t u,k) — 
argmin { F(x,y,u,k) /xe ** } is a good approximation for x* while both the "center" y e int Q and 

k > £ u, are fixed. 

If by using x we can improve the approximation u , then it is possible to develop a method 
where the convergence is due to the Lagrange multipliers update rather than due to the center or the 
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barrier parameter update. 

Our goal is to develop such a method, but first we will try to understand under what 
conditions the MIDFs F(x,y,u,k)andH(x,y,u,k)vri31be strongly convex in x when both y and 
k > 0 are fixed. 

The following proposition is the first step in this direction. 

Pmnosition 4 . If / (x ) e C 2 , i = 0, 1,..., m , then for any fixed y € int Q , k > 0 and any 
KKTs pairs (x*, u ) the following is true: 

P4 F"J.x ‘ji.u’.k) - A' 1 "„(*■.«■) * 

A •' O',* • )(*0J,)(* ')f Vr'f'd* •) - (/'„,(* »< W* '»! 

and 

a •* o-.* • )(*(/'(„(* •)/ y;/ (r) (* •) - (/'m(* ')/“« '») 

The proof is given in the Appendix A1 . We are now ready to prove the first basic statement. 
Theorem 1 . If / (x ) e C 2 , / - 0, 1,..., m , then for the convex programming problem (1 1) the 

following statements are true: 

1) for any fixed y e int Q and k > E the function F ( x. y, u. k ) is strongly convex in the 

*ighboHiood*'ifoneofthe functions /„(*) or -/<*), i - 1 * » strongly convex or efficient 

regularity conditions (1.4) are taking place and r = n ; 

2) if none of / 0 (x) and -/ (x ), / =* 1,..., r are strongly convex and r < n , but the second 

order optimality conditions (1.4) - (1.5) are fulfilled then there exist k 0 > 0 large enough that for any 
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fixed ye int Q and any fixed k> My, x')k 0 + Z u‘ there exist such that p > 0 andM < + - that 
the following is true : 

P5. a) mineigval F ,, xx {x*y,u *)c) i A 

b) maxeigval F" xt {x t y,u*^) s &' l (y,x*)M 
Proof 1) Using P4 for any v e R“ we obtain 

F" xx (x , y,u*Jt)y,y) * A-‘CK,x*)[I /, rt (x* > ii*)v,v) ♦ 

A-‘ (y,l •)(*(/'„)(* •)/ V'.f# Ov.v) - ((/'<„(* ’yf%y “<W* >. v ») 

- A <y,x ■)((! "„(* > -)v,v) - A (v,* •) (* - £«/) ((/' w (* ‘» r U;/ w (* •) v.v) 

♦ A-'O-.x'Kffi I, (/',(*■). v )*) - CVC/'.OO.v)) 2 )] 

Taking into account identity 

( £ o ( £ »; (/',(x o-v) 2 ) - (£ «,•(/> •),*)? < 4 » 

M M <■> 

. ■/, £ z«;u.‘<f <x-),yf*o 

i-i j-i 

we obtain 

(F" xx (x\y,u't)v,v) * A 1 O^.jc *)[(^ *) (4.2) 

♦ A (y.x *) (k - Eu/) (ff\ r) (x *)) r >* V )J 

So for a convex programming problem (1.1) the function F (x, y, u, k ) is convex in x for any 
y e int Cl and k a E u*. 
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If one of /o (x), -/ (x ), / = 1,..., r are strongly convex then due to u* > 0 , / * 1, ...,r the 
Classical Lagrangean L(x, u* ) is strongly convex in the neighborhood ofx* while the matrix 
(k - T.u t * ) A' 1 (y t x *) (f {r) (x *)) r U;/\ r) (x *) is non negative defined for any y € int Q and 
liE u*, therefore F(x,y, u, k) is strongly convex in the neighborhood of x*. If/ 0 ( x ) and all 
-/ ( x ) are convex then L ( x, u ) is convex in x, if addition (1.4) is satisfied and r = n, then for any 
y e int Q and k i Y. u * the matrix (*-Eu/) A' 1 (y,x * )(f {r) (x * )) T U r ‘/{ r) (x * ) is positive defined 
and again F(x,y, u, k ) is strongly convex. 

Note, due to / (x ) e C 7 , / - 0, 1,..., m the MIDF F(x,y, u\ k ) will remain strongly convex 
in x for any u e R,* close enough to u 

2) Now let's consider the case when none of f 0 (x) and -/ (x ), / = 1,..., r are strongly 
convex and r < /» . If Jb > A (y,x ') k Q + Hu‘ , then due to (4.2) we obtain 


(F""(x\y,u-*)v.v) * A -‘OvO«£ "„(*>•) 
♦ *0 VvP’yf V'Ylrfi *)]V,V) . V V6 r 


(4.3) 


Therefore if the second order optimality condition ( 1 .4) - (1 .5) are satisfied, then due to the Assertion 
1 with A =1 N m {x \u *) and B =/' (r) (x *) for * 0 > 0 large enough, any "center" y e int 0 and 
any Jfc> A (y,x*)k 0 + E»,* there exists n>0 : 

* A _l (y,x*)l*( v » v )» VveE" (4.4) 


It is also clear that for a fixed ^ e intQ and fixed k > A (y, x* ) k 0 +E u‘ there exists M <« 

(F ,, xx (x\y,u*Jc)v,v) i A' I (y,x*)A/(v,v), Vvc 1" (4.5) 
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r f 

So the condition number of the Hessian F” m ( x* y, u,k ) is fixed at the K-K-Ts pair (x*, u* ) and due 
to / (x) € C 2 it remains to be true in the neighborhood of ( x\ y' ) for any fixed "center" y e intQ 
and any fixed barrier parameter k > A {y, x’ ) k 0 +£ u' . 

Remark 1 . The second part of the theorem 1 remain true even for nonconvex problem if the 
second order optimality conditions are satisfied. In other words the barrier parameter k not only 
allows to retain the convexity in x of the MIDF F(x,y,u,k ) but also provide convexification of the 
F ( x, y, u, k ) in x in case when the Classical Lagrangean L ( x, u ) for the initial problem is not 
convex in x € R«“. 

Remark 2 . Theorem 1 holds true for the MIDF H(x,y,u,k). For any y € int Q and any fixed 
k i 0.5 k 0 A (y, x*) + E u,\ there exists p > 0 and M < <» that for V ve R* the following is true: 

a) A- 2 Cy,x*)([I /, re (x>*) 

+ * 0 (/<,)(* *)) r u;/\ r) (x *)] V,v) 2 A‘ 2 Cy,x *) p(v,v) , 

b) *y»u *^)v,v) s A' 2 O',**) A/(v,v) 


5. Modified Center Method It follows from the Theorem 1 that to solve a constrained optimization 
problem for which the second order optimality conditions are fulfilled, it is enough to find a minimizer 
for a strongly convex and smooth in* function F{x, y, u‘, k ) with any fixed y € int Q as a "center" 
and any fixed * > A (y, x* ) k 0 + E u*. Due to the strong convexity of F( x, y, u, k ) in x to find 
an approximation to x* it is enough to find a minimizer 


$ = x(y,u,k) • argminl F(x,y,u,k ) /*€ R" ) 


(5.1) 
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for a given Lagrange multipliers vector u € H_" close enough to u , when both and k are fixed. 
Moreover, as it turns out, having the minimizer £ one can find a better approximation u for the 
vector u without changing both.y e int Q and k > 0. 

Let's consider it with more details. Assuming that the minimizer £ exists, we obtain 

F&y.u.k) ■( l -*-'£«,**-'£ 2 ).£ (V £ S,/ (?) - O’ (5 2) 

where the components of the new vector of Lagrangean multipliers u"u (y,u,k) are defined by 
formulas: 

0, (y,u,k) - a, 4 (y.i) (*/<i) ♦ 4 (v.i))' 1 , 

La d, (i. y. k) - kf, (r) + 4(>. z) then for the Lagrange multipliers update we have the following 
formulas 

3, (y,u t k) - u, A (y,$) </,'* ($,>,*), jn (5 3) 

For mulas (5.3) are critical for our further considerations. 

First, we have u (u \y,k) * u * for any fixed y € int G and properly chosen k > 0, i.e. u is 

a fixed point of the map u -» Si (u,y,k) . 

Second, we will show later that for the new vector u the following estimation: 

|a -«*| rcF 1 A (y,x‘) |ir -a* | (54) 

holds, and c > 0 is independent on e int Q and k > 0, where | x | * | x I. 3 max \x t \ . 

Third, it turns out that the estimation (5.4) is taking place not only for u but for the minimizer 
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|$-JC* \*ck~ x A (y t x *) | u - u * I 


(55) 


$ as well, i.e. 


In other words, finding a minimizer 9 and updating the vector u € R„" is equivalent to applying to 
u € R_," an operator : 

C y , k C rt u = G(u.y.k)-u 

Note that C y k u = u The operator C, * is a contractive one if 

\C y k u -u* | * \C yk (u -«*)l<l« -«* I 

The contractibility of C y k is defined by 

ContrC yk = 

The constant c > 0 depends on the input data and the size of a given problem and independent 
on y and k. We will characterize the constant c > 0 in the course of proving the basic theorem. 

So, for a given problem, the contractibility 0 < y, * < 1 depends on the "center" y e int Q and the 
barrier parameter k > 0. 

The independence c on y and k makes possible to reduce Y* * > 0 to any apriori given level 
by increasing * > 0 under the fixed y , or reducing A {y, x*) under the fixed k or by changing both 
the "center" y e int Q and the barrier parameter k > 0 in the process of solution. 

In parti cular , for any "center" y € int Q and any given 0 < y < 1, one can find such a barrier 
parameter k > 0 that the operator C y k will shrink the distance between current approximation (x,u) 
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and the primal dual solution ( x*. u) by a factor 0 < y < 1. Now we will describe the basic version 
of the Modified Interior Center Method. The convergence and rate of convergence will be considered 

later. 

We start with .ye int Q ,u° = e m = ( 1.....1) e R" and k> m. Let's assume that the couple 
( x‘, u 1 ) has been found already. Take k>T u ‘ , then the next approximation ( x'* 1 , u'* 1 ) we find 
by formulas: 


x'* 1 = argmin {F(x,y,u \k)/xe R" ) (5.6) 

: u/ +l * «/ A (y,x** 1 ) d t ~ l (x** 1 ,y,k) (5 7) 

First, let us consider conditions for the problem (1.1), under which the method (5.6) - (5.7) is 
executable. 

To simplify our consideration, we assume 

Al. inf /<»(*)>- ~ 

x e R" 

We also assume that the set of optimal solutions for the problem (1 . 1) is not empty and bounded, i.e. 
A2. X * = Argmin {./£(*) lx e Q } * 0 is bounded. 

Taking into account the Corollary 20 ( see [Fiac M68] p 94) and assumptions Al - A2, we conclude 
thatthe set Q*00 - {* : kf t (x) ♦ A O',*) * 0 , M... ; *(y,x)>0) is bounded for any 
y e int Q and k > 0. Also x - d Q* 00 ■*- F(x,y,u \k) - - .therefore for any u‘ e R++", ye int Q 
and k > E u,’ the function F ( x, y, u‘, k ) is convex in x e Q k (y), and the minimizer 
x^‘ - argmin { F(x,y, u , ,k)/xe R* } 
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exists. Therefore 


F'(x-',y,u *,*) . (1 -k-'Zu,' * t-'Eu,*’ 1 )/£ <*'*') - E (*"' ) - 0 (5 8) 

and the vector u** 1 e R^“ if u' e R**“ 

Hence, starting with a vector u° € R**“ one can guarantee that the Lagrange multipliers will remain 
positive up to the end of the process without any particular care about it. 

Before discussing the convergence results we would like to describe briefly the dual interpretation 
of the MCM (5.6) - (5.7). 

Let's consider the dual function 

/»(*/) = inf {I (x, u)/xe R" } 
and the dual problem 

h (u*) - max {h (n ) / u € R*" } (5.9) 

Along with the Classical Lagrangean L(x,u) for the initial problem, we will consider an approximation 
for it 

L(x,u,u \k) *(1 -Eu t / t (x) 

Note that L ( x,it,it.k) m L (**0- We also consider an associated with L (x.u.u’.k ) approximation 
for the dual function 

h(«, u‘, it) -inf {L(x,n,ii',*)/xe R" }, 

which is equal to the dual function h (u ) when u = u‘ , i.e., h ( u‘, u \ k ) * h («' ) for any k> 0. 
Along with the dual problem (5.9) we consider the following convex programming problem 
max{h(«, u\ k)/ueK m } 

The function h ( u, u \ it ) is concave in u 6 R+" and due to (5.8) L', ( x'* 1 , u , u k ) - 0. 
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Therefore L(x**‘ , u** 1 ,u',k) m h( a ** 1 , a', k). For the subgradient of h ( a, a*, * ) at a " 1 we 
have 

dh(u‘",u‘,k) = k'f 0 (x** 1 ) e m -/( ) (5 10) 

f(x) = (/, (x ). Using formulas (5.7) for the Lagrange multipliers update, we obtain 

/(x- 1 ) » - *' 1 A (y, x** 1 ) a* ( u** x )•* + k’ x f 0 (x-* ) - k *7, O' V. (5.11) 

where a* ( a** 1 ) l = ( a/ ( a/* 1 X 1 ,..., a.' ( n.* 1 )*' ) 

So in view of (5.10) and (5. 1 1) we obtain 

aMiT 1 .#'.*)--*' 1 *(y,x'+ l )W(u'+ l y l -k- l f 0 (y)e a 


or 

^(a^'.a',*)* *•* b(y.x'")W{u'* l y l + k l f 0 (y)e m = 0 

Therefore 

« ,+l = argmaxdl («,«',*) +*' 1 Em/ [ A (y,x** 1 ) In u t (a/)' 1 

(“«V -fo(y)] /u **Z^ 


( 5 . 12 ) 


The method (5.12) has some similarities with the prox-method with eutropy-like kernel (see 
[PolTeb95]), which corresponds to MBF (see [Pol92]), however, there is a fundamental difference 
between them as well. In contrast to the prox-method, which corresponds to MBF, the dual to MCM 
is dealing not with the d 11 ** objective function h (a), but with an approximation h (a, a', k ) to h ( a). 
Therefore the convergence results for MBF method cannot be applied to (5.12). 

We will obtain the convergence results for the method (5.12) as byproduct of the correspondent 
results for MCM (5.6) - (5.7). 

These results will follow from the Basic Theorem, which we are going to prove in the next section. 
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D (*) ■ 0|(*) ® •• 0^ (•) ® •• 
u -u' 

K,y K,y *<» *’* 



6. Bilk Theorem . The Basic Theorem establishes the contractibility properties of the operator 
C y k . We will start by characterizing the domain, where the operator C y k is defined and possesses 
these properties. Let's consider a small enough number t > 0 , a fixed y 0 € int Q and a subset 
Q t = {x \f{x) 2 x e } n { jc : A Cv 0 ,x)> 0 } of theRFS Q (y). Note that due to A1 - A2 and the 
Corollary 20 (see [FiacM68]) the set Q, is bounded. We will choose the "center'' y from Q,. For any 
;eQ,we have A (y. x*) > 0. Along with t > 0 we consider a couple of small numbers t > 0 and 
6 > 0 and a large enough k 0 > 0. In the course of proving the Basic Theorem it will become clear 
what "small" and "large" mean. 

To characterize the domain, where the operator C r *is defined, we will consider two types 

of sets. 


The first type 

D t (*)={(y t u,k):u t *t t \u t -u* | sftA' l (^x *)*,*** 0 A(y,x* ) + £«,*), / = l,...,r 
is related to the active constraints. 


The second type is associated with the passive constraints. 

D i (»)={(y t u t k):0*u l i6&~ i (y,x*)k t k*k 0 &(y t x')+Eu*},i=r+l m 

The set D (•)»£),(•) x... *£>,(•) *... *£>„(•) is the domain, where the operator C yk is 


defined. 

We will prove later that for any fixed ye Q t there exists k 0 > 0 that for any k a k 0 A( y,x ) + Eu ( * 
the operator C yik is a contractive on D (•). 

For a fixed .yeQ, and a fixed k a k 0 A(.y,x*) + T, u' , the domain D (•) shrinks (see Fig. 1) to 


Uy.U - » 





We are particularly interested in the set U y k , because as soon as both the "center" y and the 
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barrier parameter k are fixed, the set U y k is the only feasible set for the Lagrange multipliers, 
moreover if C* * is a contractive operator then u e (J y k — u e CJ y i . 

Before we turn to the Basic Theorem, let's briefly describe the main idea of the proof. In 
view of A1-A2 for any ueR**" ,>»€£), and *>E u „ there exists the MIDFs minimizer x = x (y,u,k) 
and 

F, 1 (9,y,u,k) -X (*) - E 2,/(i) - *(i.J ’.».*) ♦*(*,)'.».*) - 0 (6 1) 

/-I 

where 

u t * u t A (y,$)d, l ($>y,k) , , (6 2) 

h($,y,u,k) - E a, A 0,£)</ < ' 1 (£,}' ) *)/ / (£) 
and 

g(x,y,u,k) = k~ l Eii^ -1 + A (y,x)d t l ($,y,k))j o$) 

<•1 

Considering (6.1) and (6.2 ) as a system of equations for $ and « (r) , it is easy to verify 
that x =x‘ and 0 (r) * u (y) satisfy the system for any y e Q t , k>Eu,\ and u~u\ 

Moreover for any triple (y, u, k)e D (•) , the system (6. 1) - (6.2) can be solved for x and 

Having the solution $ - 2(y. u, k ) and = u (r) = u (r) (y,u,k ) one can find the Jacobians 
$'(•) •J M ($(y,u,k)) and %,,„(•) = J u (u (r) (y,u,k)) and estimate 

I K Cv,« *,*) I and | ^ r)- ( y,u \k) | 

It turns out that under second order optimality condition, there is such k Q > 0 that for any 
y € Q t and k z k 0 A (y,x') + E u,‘ the following estimation 

tnax{ [$l(y,u*,k) | , I u[ r)u (y,u \k) | ) s, c (6 3) 
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takes place and c > 0 is independent on y and k 

Due to the continuity (•) and u^ M (•) in u the estimation (6.3) is taking place in the 
neighborhood of u* . 

In view of jc* = x(y t u*,k) and u * = u(y,u\k) and using (6.3) one can estimate 
| 2 -x * | and | u - u * | through | u - u ' | . 

The independence c > 0 on y and k makes possible to prove that for any fixed y e Q, there 
exists k 0 > 0 such that for any k * k 0 &( y,x* ) + Eu/ the operator C y k is a contractive one, i.e. 

0 < y,.*< 1. therefore u c U yk -» C yk u = u e U yk . 

In the course of proving the Basic Theorem we will assume 

min {/(**)/ / = r * 1, } = o >0, mini A (y,x*)7ye Q t } * t 0 >0 , 0 P1 be \hep * q 
zero matrix; I' be the r * r identity matrix, S(a , e ) ■ { xe R" : \x - a \ a e }. We remind that 
d t (x,y,k) 3 (kf t (x) + A (>,x))and introduce three diagonal matrices d(x,y,k)~ 

[diag d t (x,y,k) ]7-i» d {r ) * [<***« d f (x,y,k)f lm „ d {m . r) ( x,y,k ) » [ diag d t ( x t y,k ) 

Theorem 2. 1 . If A1 - A2 are taking place, then for any y € Q t , u e R** - and k > £ u* there 

exists £ * j? (y t u,k) » argmin { F(x,y,u,k) / x e R" } : F'($,y,u t k ) * 0" 

2. If « C 2 , and standard second order optimality conditions (1.4)- 

(1.5) are taking place then: 

a) for any triple (y, u. k) e D (•) the minimizer x - $ ( y,u,k ) exists, 

F;(x,y, u,k) » 0"and for the pair x and u 3 u(y,u,k) the following estimate 
maxi |£-x*| , \u - u' \ ) i ck~ l A(y,x*) \u - u* | (6.4) 

holds and c > 0 is independent on> and k . 
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b) for any fixed y e Q, and k > k 0 A(y.x') + E u' , the MIDF F(x,y,u,k) is 
strongly convex in the neighborhood of $ and for any u e U y k there exists 
£ > 0 and M < » : 

mineigval F" (x,y, u,k) 1 A ' 1 (y,x ')$ (65) 

maxeigval F" ($,y, u,k) i A “* ( y,x *)M (6 6) 

Proof 1) In view of the assumptions A1 - A2 and the Corollary 20 (see [Fiac M68] p94) the set 
Q*(y) k/^x) + A (y,x) i0 , i - A Cv,Jf)>0 } is bounded for any yeQ, and k > 

0. Also x e d Q k (y) - F(x,y, tS, k) , therefore for any yeQ„ue RJ” and k > E u, the 
function F ( x, y, u, k ) is convex in xe 0*(>») and * x(y,u,k) is an unconstrained minimizer 
of F(x, y, u, k) , i.e. F'($,y,u,k) * 0*. 

2) For technical reasons, we introduce a vector t = * t * k~ l A (y,x *) (u ( - u‘ ) 

instead of the vector of Lagrange multipliers u.then u - u — t - O m . Such transformation translates 
the neighborhood of u into the neighborhood S(o, ft ) - { / : | /, | s ft , i * 1, ...,m } of the origin 
of dual space. 

We will split the vector u on two parts, which correspond to the active and passive constraints. 

Let i) (r) = (u f , i * l,...,r ) is a vector of Lagrange multiplier, which corresponds to the active 
constraints, while Q (m . r) * Q (m . r) (x,y,t,k) » (u t (x,yj,k), i*r+l,...,«) is the vector of Lagrange 
multipliers, which corresponds to the passive contraints. 

We have ti f (x ,y,t,k) *kA~ l (y,x ')/, A (y,x) d~ l ( x,y,k ) i=r+\ m, u » (u (r) , u (m , r) ) and 

for the vector function h ( x, y, u, k ) , g ( x, y, u, k ) we will have the following replacement. 
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A(*,M»>* £ 2, (*,*/.*) «'(^)) r - . 


I*r* 1 


j£(^i A" 1 (y.**) ♦*»/)[ -1 + b{y,x)d;\x,y,k)] 

l/-i J 


(/ft*))' 


So for any Jfc > 0, small enough e 0 >Oand y*Q t the vector functions h (x,y, t, k ) and g (x, y, t, k) 
are smooth inx € S (x*, e 0 ) and t € S (0,5). Then we have h(x * y,0,k ) = 0" , g(x \y,0,k) = 0" , 
h'(x\y,Q,k) = 0T'\gUx t .y.0>k) = -A- l O\x*)/'JV)./o(**) also h f £ ^(x\y,0,k) = 0 V 
and £$ ( x\y,0,k ) = 0’ v On 1 S(x*,e 0 )x5(« ( * > e 0 )x Q t x5(0,5)x(0, +«) we consider the 
map ♦ (x,£ (r)>< y,/,*) : E 2 " +r *" + l - E" +r defined as follows: 

Qix^yxk) * {/'o'* (x) - E VTC*) " *(*»*»*•*) + £(-W’*)i 

Jfc _1 A Cv.x •) [ (*A _l Cv.jf *)', ♦ «/ ) A (y,x) d~ l (x ,y,k) -u t ] , i = 1, ...,r } 

Taking into account (1.3) and /K x *. 0, k ) * g( x*. .y, 0, k ) ~ 0" we obtain 


<&(x *,«(*) ,y,Q,k) *0" +r for any k > 0 and ^eQ,. 

Let •'a ■ *' x£ (x \u ( * *0,*) , L* » L "(x \i i *) ,/ =/(x *) X) = 4)( x *> * 

*W to v 

t/; = [ diag u/ ]%, , uj) = (»/ , i * 1, .,/•) 

In view of /» x '(x >,0,*) = O"** , h ' £ (x \* 0,*) = 0 . * x (x \y.0,*) 

(d 

= - A •* Cv.x *)/o r ( * ‘)/t(x * ) , g ' £ (x 0, *) = 0*' we obtain 


*(/.»)■ 


L" - -f'U 

-u;fn -k-'A(y.x-)I' 


Now we will prove the nondegeneracy of the matrix for any y € int fl t and any 
k i k 0 &(y,x*) + E «/. Let us consider w*(z,v)€^"^ r , then the system <D (J , *, w * 0* + r can be 
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rewritten as follows: 


I "z - A' 1 (y.x 'V'Zft z-f'L v - 0* 


(6.7) 


Z - k ~ l A (y,x *) V = C f (6 8) 

We find v from (6.8) and substitute in (6.7). Taking into account the K-K-Ts condition 
/’o = «*(,)/'(,) we obtain 

L^z * L.-'(y.x-n*fi ) ux ) - (/' ( > ( ; ) r «<;/„)]*> o- 

i.e. 

(l"z,z) ♦ A- , Cv.* , )(*(t/;^ > z.4 > z) -(a^z) 2 ] -0 

The inequality 

k i k 0 &(y,x*) + E u,‘ 


implies 

(Z "z, z) ♦ A" 1 0\x *) [*( £/,* z 2 ) - (“(*>/£) 2 )* 1 
* (Z "z,z) + A O',* *) [k 0 A Cv.x *) (t//^ z X ) * ) 

+ ( £ ) ( U r fZ) Z jZ) Z ) * ( M (r)^(r) 2 1 

= (Z"z,z) 'k^U^z.z) ♦ A^Cv.z^KEu/XEiiX.z) 2 ) -(Eu/tf.z)) 2 ] 


Due to identity (4.1) we obtain <£«/)(£«/ tf'.z) 2 ) - (Eu/ tf.z)) 2 z 0. Therefore taking into 

account Assertion 1, we have 0 ■ ((ij + 2 ) 2 ^C 2 * 2 ) » l 1 > °* i e - z * hence 

from (6.7) we obtain / ,r w v - 0, so due to (1.4) we have v - 0, i.e. <fr ( ,. *> w = 0"*' - w = (T‘ ' , i.e 
4 (y Jk) is a nonsingular matrix. 

Let k i be large enough. We consider a compact 


28 



K = {0"} x ((>,*) : y € G t , * * 0 A ( y,x *) + E u * } 

Since <Hx\ti iry y.0,k)‘0 M *' t the matrix Q y k = (x*,u (r >,0» is nonsingular, 

f t (x) e C 2 , / = l,...,m and K is compact, it follows from the second implicit function theorem 
(see [Ber82]p. 12) that there is a small enough 6 > 0 that in the neighborhood 
S(K,t>) = {(y,t,k ) : | *, | * 

/ = , y e G x , * e [* 0 A (y t x *) + E u* ,*,] lofthe compact K there exist unique continuously 

differentiable vector - functions *(•) s x(y,t,k) - (x l (y,t,k),...,x n (y,t,k)) and 

“(,)(*) = “ W (M*) = (u x (y,t,k),...,ii r (y,t,k)) such that x(y,0,k) = x‘,6 (r) (y t 0,k) = u ( ‘ r) 

and for any triple (y, t, k) e S (K 6) there is e 0 > 0 that 

max { | x(y,t,k ) - x * | , | ^ r) (yJ,k) - I ) * *<> ^ ^ 

The identity 

= *(*(•).«(,)(•).•) ■ 0 M * r (6 10 ) 

holds true for all (y, t, k) e S(K 6). 

So we obtain 

/'.'•(*(.)) - E 0 , (•)/',’ (*(•)) -*(*(•).•) ♦*(*(•).•) <«"> 

M 

■ A ■ °* 

which is the necessary optimality condition for the vector x (•) to be a minimizer of the function 
F(x, y, u, k) in x under the fixed (y, u, k ). Also from (6. 10) we obtained the identities 
fl 4 (.) ■ (JtA^CK.jr •)*,♦«/) A (yM*))d,' l (x(*),y,k) i*l r for the Lagrange multipliers 

that corresponds to the active contraints. 

After multiplying both sides by k‘ l &(y, x* ), it can be rewritten as follows: 

(/, ♦ *- , A0\x*)«/) A (y,x(*))d t ' l (x(*),y t k) -k‘ l ^(y t x *)«,(•) -0 r*l,...,r (612) 
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The Lagrange multipliers that correspond to the passive constraints , we can rewrite in the following 


way: 

«,(*(•).•) ■ £,(•) - kA' l (y,x , )t i A (y,x(*))d' l (x(*),y,k) / =r+l,...,m 


(6.13) 


let %. r) (*) = («,(•). and «(•) = (%(•). «(„-,)(•» 

To prove the sufficient optimality condition for the vector x (•) to be a minimizer of the function 
F ( x, y, u, k ) in x under fixed ( y, u, k ) we will show later that the function F (x, y, u, k) is 
strongly convex in the neighborhood of x (•) for any ( y, u, k ) e D (*). But first of all we will 
ascertain the estimation (6.4). 

To this end let us first prove that for small enough 6 > 0 and large enough k 0 there exists p > 0 such 
that the inequality 

1 s p (614 > 

holds true for all ( /, y, k ) e S ( K & )• 

We consider the matrix 






- Ur'j t 


w 


\r.r 


The matrix is nonsingular for any y e Q t . In fact, for a vector w-(z,v)eR" r the system 

*<,-> w-<T' 

can be rewritten in the following way 

L "z - A -‘(mVK* -/'m v = 0 ’ (S I5) 

< 616 > 
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Because of u\ r) ></ from (6. 16) we obtain /' (r) z * O' , i.e. (/! ,z) » 0 , /* l,.„,r, therefore 

( E u'f { , z) * (ji ,z) * 0. Multiplying (6.15) by z we obtain 
M 

(l" z,z)-A- , <y,x-)(/;,z) 2 -(vX>z)*0 (6 17) 

i.e. 

(L"z t z) *0, Vz:^ / r) z = 0 r 

so due to (1.5) we have z * 0" then from (6.14) one obtains/’ r (r) v - 0", which due to (1.4) implies 
v*(y. 

Therefore, <t> ( * w * QT' implies w - QT r for any y e Q t i.e. the matrix <P (y m) is nonsingular, so 
there exists a constant p > 0 independent of k and j'eQ, such that 

I 1 ^ Po ■ 

Hence, for the Gram matrix G^^ * $£,,.) we ^ ave mineigval G (y m) * p 0 > 0. Then there 
exists a large enough k 0 > 0 such that for any y e ft , and k t k 0 A (y,x*) + E u* i.e. k' x A( y,x * ) < 
k 0 ' 1 we obtain for the Gram matrix G^ k) * ®(y y k) inequality 

mineigval G( y ,k) * ^ l 4 * 

and Ho > 0 is independent ofye Q, and k € [ k 0 A (y,x*) + E u*, k t ]. Therefore *, is not only 
nonsingular, but there exists a constant p > 0 independent of y € Q f and k * k 0 A(y.x') + E u* such 
that 

I I - | *'^(iO,0.*);fi,Cy,0.*)) I £ p (6.18) 

The last inequality implies (6. 14) if 6 > 0 is small enough. Now we will prove estimation (6.4). First 
let us estimate the norm I _ r) ( • ) I 

Due to (6.9) for any small enough 6 > 0 there exists such small enough e 0 > 0 that for 
V(y.t,k)eS{K,&) 
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max{ l/ 0 (*(M*)) -/oO'O'.O.*)) I . I/(*(M*» “^CvA*)) | /=r+l,...,m } s e 0 . 
Therefore, in view of /, (x* ) a o > 0 for the passive constraints , we obtain 
f t {x O, /, k) ) a ^ > 0 , / = r + 1 m and for the Lagrange multipliers, that correspond to the passive 
constraints we have 

= “,(£00 _ 4(*(M*))) # (W*(M*)) + ^Cv) “Jo(*(M*)))“ l 
= “iOoCy) -/ 0 (*‘) + / 0 (**) -/ 0 (*to^))[W* •)-*«(*•) 

V 0 Cv) -/ 0 (O -<&(*(***> -/ 0 (**»r l 

* «,o^o> -/ O oo + - (* + n«o VoOo -/ 0 (**»‘ l 


* ^0500 -/ O 00 + e 0 )(/;(jr‘) - ** l (* ♦ l)e 0 +k-'(f 0 (y) -f 0 (x'))Y l . 


Hence, for small enough e 0 < — fc(J: + 1 )‘ l we obtain 




2« < (/ 0 Cv) -/ 0 (**» 


, 4 ^ Q ’)-/ o ( f 2>,,.„ 1 « 

ka 


k[o - 1 + (f 0 (y) -/ 0 (* '))k-') 

So we have 

I u (m-r)(y-‘> k ) - "(*—) I * “ i/oCv) -W* *» I "(— ) " I 

Now we will show that the estimation (6.4) holds for $(y,t,k) and Q^(y,t,k) = 

(i) (y,t,k) , i * 1 ,..., r ). To this end we differentiate the identities (6.11) and (6.12) with respect 

to/.Let */(•) - / f (2(0) -(*/,(•) J- 1-,* ).$£*,(•> “ I *- r)are 

the Jacobians of the vector functions x(*) and t^C*) . Also L(x(»), u r (*)) *Jq(x(*)) ~ 

E^(.)^(x(*)) and let J,(h(x(*) t •)) and J,(g( *(•),•)) are the Jacobians of the vector 
m 
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functions h(x (»), •) and £(*(•)»*) Then differentiating (6. 11) with respect to / we obtain: 


i=w-). %(•))*,'(•) - (/£>(*(•))%.,(•) 


(6 19 ) 


Let F (r) (ot(«))»r 




, * [diasC^ A O',**)) ft * 




« 


+ ack,*(*)) •&)(*(•). y.t)F (r) (x (*))] , 


then differentiating (6. 12) with respect to t we obtain 

T(*(»).M*)*/( # ) -*' l ACy,Jf*)^( # ) " 

( 620 ) 

- (A Cv,jf(*)) • ; o r ' m "] ■ $(*(•).•) 

Now we consider Jacobians J t (h(x( •),•)) and ./,(£(*(•)* •)) *® morc detail. 

Recall that 

h(x(y,t,k) t y,t,k)*h(x(»\»)* £ 6 < (ar(*),»)(/f(Jt(*))) r *(u (ll ,. r) (Jc(«),*) < /J,. r) (Jc(*))) r 

/»r+ 1 

Therefore 
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= [ E $,(*(•).•) </"(*(•)) + / / i-o( JC (*)))^-rXx( Jt ( # ).*) ]*/(•) + 

/ *r ♦ 1 

/£,>(*(•)) [ O*-'-' ; *4-‘(v,x ")A O. ( * ).>. * )I - 

JV(jcC*X-)*/<*) * 9(xC-X •) 

Taking into account Jt(y,0,*) - x * , u ( (x \y,0,k) * u/ * 0 , i = r* 1 

we obtain = 80(1 <i( x ‘>y > Q > k ) = 

f’l .„(*•) [O’ ; M, •,>’,*)] 

Now let us consider the Jacobian 

«J,(*(*( # XO) ■f.O'OX •)*/(•) ♦ */(*(*X # ) 

= & / (^(O.oV(O + A' l 0»,jtO/ r /(^(O)[^WJf(*))(^(*)) + A*‘Cv,x(.))]7 

*«*(*(•)» •)*/(•) + p(*(*x # ) 

For the Jacobian g' t (x (• ),• ) we obtain 

&'(*(«XO -*’ 1 [ £ (fyA- l to*(0) *0(-l + 

(-1 

+ *' 1 /o r (*(0) [ £ (JtyA^CM*) ♦ «,*) * (-Jo(x(*)H'\x(*) t y,k) 

M 

- AO',jc(OH‘ 2 (Jf(*).y,*)(*/(*(*)) -Jo (*(•)))] - £(*(•). •) 

Taking into account (1.3) and x(y,Q,k ) * Jf * , [ diag {kt t A l Cy,x *) ♦ ) 3i/ r «o 3 • 


34 



F. - FJx(0.y,k)) - m 

G(x\y,0,k) =■ - 4 •'O.x •)/ 0 r (* ■)/*(*'). 
p(x‘,0,y,k) » 4 -'(y.x •)(0»' , F ( '.„ [ diag k/,(x •) d?(x',y,k)T,.\ ] 

Therefore, 

J,(g(x ’.y.O.*)) - 4 -• (y.x -)fl(x -)jl(x •Jj/O-.O.*) * p( x ’.O.y.k) 
T(i >. 0 ,*) - - O'*-' ] 

?(* >.0,t) - k ■' 1 0" ; /£_„ (JT •) <.„(X '.**) 1 

We recall that 

•),%)(•),•) ■ 3 

/* r 

£=(*(•).&(,,(•)) ♦G(*(*). •) -JWOr(-).-) 

*(*(•)••) -i" 1 &(y,x')I r 




Then combining (6.19) and (6.20) we obtain 


rn 

n */(•) -(♦'-(•))‘ l "| *(*(*).•) -/>(*(•),•) (*))' 1 *(*(•).*)• 
r S' (.) rl «■*(*).•) J 
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Now we consider the system (6 21) for t-0". Taking into account 


£=(*(>. G(x(y,0,k),y,0,k) 

* - (f a (y) -f„(x •))-'/ 0 r (x -)/Z(x *), N(,x(y.0My.0.k) - 0** , 

T (x(y,0,i) ',y,0,k) - - u'f lr) (x •) , j(xCv.O,*) ,y,0,k) • q(x \y,0,k) 

■ k-' [0"-' ;/£ _„(x •)</„'_, ) (x >,*) ] ,p(x(y,0.k),y.0,k) ~p(x‘,y,0,k) • 
[ »' r i „(* •) [ dia tf,(.x •)</,-' (x >,*)£, ] . 
S(xCv,0,*),>,0,*) - S(x \y,Q,k) •(-/'; O'**' ) 


we obtain the following system 




= * 


-l 

O'.*) 


-p{x \y,0,k) 
S(x\y, 0,k) 


<„.»(X>, 0,*). 


( 6 . 22 ) 


Therefore 

aux ( | x/0-,0,*) | . | % r) ,,(y,0,k) ■>< I *£ t) 1 1 R(x \y.0,k) t . (6 23) 

Taking into account min { A( < y,x*)/ y « Q ,} * t 0 > 0, 

I [ diag (f,(x •) ♦ k A Cv.x •))> £, U o ' , 

I [ di«« (f,(x ’W,(x •) * k ■' 4 Cv.x •))-' r.i 1 1 1 . 


one obtains 
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| ? (x •,/. 0.t) 1 1 o ‘ l/'£.„(* •) I . \p(x \y,0.k) 1 1 1 0 ' | F'. r) (x *) I , 

I SC* ‘,0,y,k) | s 1 and I R(x‘,y.O,k) | s o' 1 l/' ( «. r) 0 ■) I ♦ 'o' I F i.-,)(x ') I * 1 

In view of (6. 18) and (6.23) we have 

max ( | x'(y, 0,k) | , | & lrX ,(y,0,k) | } s p ( 1 + o' 1 \f'[ m _ r) (x *) | + t 0 ‘ | F^. r) (x *) I ) * c 0 

So there exists a small enough 6 > 0 such that for any (y, t, k ) € 5 (K, 6 ) the inequality 

|(*'$ ( x(y,at,k),Q, r) (y,at,k)-,y,at,k)y l R(x<y,at,k),y,at,k ) \ 

w (6.24) 

s 2p( l ♦o’ 1 1 /£.„(* •) | * t 0 " 1 F ( „. r) (* •) I ) - c 0 

holds true for any 0 s a s 1. Also we have 

x(y,t,k ) -x(y, 0,*) 

&,t y.U) -O,(y,0.k) 

[° « «•*)» “mCV. « ».*).>. «*.*)• R(x(y,*t.k) ,y,at,k)[:]drx . 

From (6.24) and (6.2S) we obtain 

maxtlx'O'.a)-** \,i%r)(y>t'k)- u ir)V*c 0 \t\‘ c 0 k- l My,x •)!«-«* I 
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Let x (y,u,k) -xO\** l A ( y,x *)(« ‘ ),k),u(y,u,k) * (tf (r) Cv,*' 1 A (y,x *)(« -«*).*), 

-a*),*)) and c = max{c 0 ,4o' 1 }, then 
max { \x(y,u,k) -x * | , | u(y,u,k) - a * | } s ck‘ x ACv.x*) | a - a* | 

So we ascertained the estimation (6.4). Also x(y,u *,k) * x * and u(y,u \k) = a * foUows from 
(6.4) for any triple ( y,u,k)eD (•) i.e. u is the fixed point of the mapping a - u(y,u,k) 

3) Now we will prove that F ( x, y, u, k ) is strongly convex in a neighborhood of 
$ = $(y,u,k) for any (y,u,k) e D(*) . 

Using the formula for F"&y t u,k) (see Appendix A2 ) and taking into account the estimation (6.4) 
we obtain for a small enough 6 and for any triple (y, a, k)e D (• ) that 

F"(i.y.u,k) - A -'tv.* ■)[£"(* \u •) * A-'Cv.JC-) (*/£,(* ') -fl(* - )) 

For any triple (y, a, k) e D (• ) we have k ik 0 A (>\x*) + E a, , and 

Jk' 1 |a, - a/ | s 6 A _1 (>,x*)»^ *1, •••,»* .Keeping in mind min {A (y, x*) | >> e 0 1 } = t 0 >0 for 
any ve R" we obtain 


( F^($,y,u,k) v,v)iA (y,x *)[((!«(* *» M *) * *b fvpr *^ v,v ^ 
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- (*A (y,x •))■* £ I | (/V* *). v) 2 -k' l Z \u* -uj (/£'(* *) v, v) 

- ftmT 0 ' l (to’ l (/o(* ‘Xv) 2 ♦ (/£'(* *)v,v))]. 

So due to Assertion 1, there exists a large enough such that 

(F"($,y,u,k)v,v) a A' l Cv,x*)[ji(v,v) - &to 1 m(t 0 1 (/o(jr*),v) 2 + (Jo'(x *)v,v))]. 

So for small enough 5 > 0 and any triple (y,u,k)eD(») there exists 0 < ft < ji: 
F"($,y t u,k)v t y) a A' l (y,jr*)iUv,v), VveB* 
i.e. for V ( y, u, k ) e D (• ), We have 

mineigval F^($,y,u,k) a A " l (y,x *)ft 
To complete the proof we note that for any triple (y, u, k ) e D (• ) we have 

* Em, * (£«,* ) ( k 0 A ( y,x *) ♦ Eli /)' 1 +6wA' l (y,s *)* (Eii/)(*b t 0 + Em/)' 1 + * m T o 1 

Therefore if 0 < 6 < — then k _1 E u t < 1 .So for small enough fl > 0 the 

m(l +(Eu,*)(*b T o)' i ) 

function F ( x, y. u, k ) is convex in x e Q* (y ) for any (y, u, k) e D (• ). Hence the vector 
£ = £(y,u,k) is a unique minimum of the function F ( x, y, u, k ) in Q* (y) and F'($,y,u,k) = 0 
Due to the definition of F ( x, y, u, k ) we obtain $ * argmin { F(x,y,u,k) \ x e B* }. 

U sing the formula for ,k)on& can find A/ such that for any triple (y, u, k) e D (•) 

the estimate (6.6) is taking place. 

We completed the proof of the basic theorem. 

Remark 3 All statements of Theorem 2 remain true for the MIDF H ( x, y, u, k ). To prove it 
we consider instead of ® (x,Q^yy,t,k) the mapping ® u (x,ft^yy,t,k ) : B 2 " * r * m 1 - B" 
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defined by 


s (f'fa) - E »if /T t (x) - i 


A v/f/ 


Cl 


*-» A (y,x *) [(*A 0,Jf *)', + O A 2 O',*) d t \x,y,k) l,...,r) 


where 


h(x,y,t,k )* E u < (x,.y,f,*)(/ / (x)) r . 


(•/- ♦! 


^(ar,y,/,*)«ir , {E(*/ / A' I Cv,x*) + «i*)[ -1 + A 2 (*x)d/ 2 (x,M)]>Oo(*)) r 




and 

* *A _2 (x *) t ( A 2 0\x) d?{x,y % k ) , i - r ♦ 1 m 

The MIDFs and MBFs [ see(Pol 92)] have some common features, however, there are essential 
differences between them as well. 

We will consider few small examples to illustrate some of the differences. 

7. Examples. Let us consider a convex programming problem 

x* • argmin 0£(x) m x |/(x) = -x 2 + x * 0} * 0 (7.1) 

The corresponding Classical Lagrangian is L(x,u) - x - u( - x 2 + x) , then L'(x *,«*) = 

1 +2u*x* - «* *0, i.e. u* » 1 . The feasible set Q = (x: -x 2 + x * 0 } * [ 0, 1 ] .Now 
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we fixed 0 <y < 1, then the following problem 

x* = argmin ( - InCv -*) | *'‘[ ln(*(- x 2 + x) + y-x) -1n(y -*)] *0,.y>x} 
is equivalent to (7. 1 ) and the corresponding MIDF is: 

F(x,y,u,k) * - \n(y -x) - k' l u[]n(k(- x 2 +x) *y -x) - InCv -x)] 

= (- 1 ♦ «*' l )lnCy -x) -k~ l u]n(k(-x 2 +x) + (y-x)) 

So F(x,y,u \k) ■ ( - 1 -*■ it _I )ln(.y - x) - k~ l \n(-kx 2 + (k - 1 )x ♦ y) and 

F'(x,y,u*,k) = (1 -k~ x ){y -x)" 1 + (2x - 1 + *' l )(-*x 2 +kx +y -x)* 1 , therefore 

F x ( x*. y, u,k) = 0. Let us consider F" a ( x, y, u, k ). We obtain F n a ( x, y. u\ k ) - 

( 1 -k’ l )(y -x)* 2 *(2kx 2 + 2(y -x) +4x - 2*x +(*- l^Jr^K - kx 2 *(k - 1 )x +y)~ 2 . 

Therefore F^(x *,y,u \k) • (2y + * - 1 )y ~ 2 and taking^ « Vb and * - V4 we obtain 

F" m (x* ;%;«*, Vi ) - - 16, i.e. the Classical Lagrangian for the equivalent problem is strongly 

concave at the solution, while problem (1.7) is convex and the Classical Lagrangian L (x, «*) * x 2 for 

problem (1.7) is strongly convex at the solution x* ■ 0 * argmin L (x, «*). 

Moreover, F(x ,i,ii *,-) » ln(- -x) - 2 ln( -x 2 -x + -j) + 2 In 2, so for x = 1 - e 
8 2 8 4 ® 

we obtain /*’(— - e, ) » Ine - 21n( -(-j - e) 2- (-^ _ e) + -j) + 21n2 * 

8 8 2 S 8 4 

In e -21n(—e - e 2 +—)+21n2*F(e). Therefore inf F(x,-i »\ -i)*infF(e) * - «. 
4 64 Oiiii 8 2 f -o 

This example shows that without the condition k>T,u * Theorem 1 and Proposition 3 are invalid 
even for the convex programming problem. 

In case of Modified Barrier Function the situation is different. For the MBF which corresponds to 
(7.1) we obtain F(x,«,*) - x - ft* 1 *ln(*( -x 2 + x) + 1 ) then F(x \» %*) »/ 0 (x *) * x * = 0 
Fg(x\u*,k) ■ 0 and F"(x*,m\*)>0,x* - argmin (F(x, «\*)| xe R l } for any*>0 i.e. 
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the Proposition and the first part of Theorem 1 remain true in the case of the MBF for any k > 0 if 
the problem is convex. 

As for the MIDF F(x,y,u,k) then the Proposition 3 and the first part of Theorem 1 are true only 
if it > E u*. 

As far as the Theorem 2 is concerned then the results for the MBF remain true for MIDF only if 
k i k 0 tk{y,x' ) + E u*, however instead of the estimation 

max{ |, \)*ck' x |«-«*| (7 2) 

(see [Pol92] pi 85 ) for the MBF, we obtain the estimation (6.4) for MIDF, i.e. 

max( -x* |, |d - if * \}&ck' x ACv,**) |w-m*|. 

Therefore one can improve the convergence as compared to MBF method by choosing a fixed 
ye Q t that A (y,x*) < 1. Now we will show that it is possible for all statements of Theorem 1 and 
Theorem 2 to remain true, even when the functions /X x ) , i ■ 1,..., m are non-concave. 

To show this we consider the following problem: 

x * = argmin {f Q (x) * x L/(x) * e x - 1 a 0) ■ argmin ix \ -e x + 1 s 0 } = 0 (7 3) 

The function /(x) * - e* + 1 is strongly concave, therefore, the Classical Lagrangian 
L(x, «)«x + tf(-e* + 1) is strongly concave for any « >0. ThenX, / (x*,ii) »0^ 1 -n *0 i.e. 
ii*»l,soI(x,ii*)*x-e* + l and infl(x,« *) - - moreover infZ(x,i/) =• - « for any u>0. 
Now let us consider the MIDF, which corresponds to problem (7.3). We obtain F (x, y,u,k ) = 

- lnCK -x) - k _I n(!n(*(« * - 1 ) +> - x ) - ln(y - x)) » ( - 1 +k tOlnCv -x ) -i k ~ l i u 

\n(k(e x -\)+y-x).SoF(x,y.u\k)-(-\+k- l )\&(y-x)-k- l \n(k(e x -\)+y-x), 
F&y.u'M^il-k-'W-xy 1 -(e*-k- l )(k(e x -\)+y-xy l 9ndF;{x\y t u\k)=0 

foranyj'>0and£>0. Then F^(x t y,u * ,k) » (1 *)(, y~x)**(.(k~2) g ~(y~x)e +k ) 


42 



(*(**_ 1 ) +<K -,)*)-» and F"(x\y.u *,*)«(* -y - 1 ^.Therefore for any yi t>Oand 
any k>y + 1 the MIDF F(x,y,u\k) is strongly convex in a neighborhood of the solution 
x* - argmin {F( x, y, u, k ) \y > x } » 0 while the Classical Lagrangian L (x, a*) = x - e* + 1 is 
strongly concave in the same neighborhood and min L(x,u *)*”*• 

X 

Now we would like to make a few comments about the estimate (6.4) and its relation with the 
corresponding estimate (7.2) for the MBF. Let us consider the matrix 4> ( ,. *, for problem (7 3). We 
have 




(**) 



» 

-1 -y-' -1 

-1 - yk ’* 

L ^ 4 

, so Qfai) * 



k-y-l y{k-y- 1)J 


It is easy to see that q (x*. y, 0, k) m p(x*,y,Q,k) m Q and 


R(x\y, 0,k) 


q(x\y,0,k)-p(x\y,0 t k) 

S(x\y, 0,k) 


0 

- 1 ! 


Therefore, taking into account (6.22) we obtain 




> 

k 

S'O'.O,*) 

0,(y.o,k) 

1 « 

- <»>*(* '..v.O.*) * 

k-y-l 

*0-1) 


y(k-y- 1). 


For all (y, u, k ) e D ( • ) there exists 

$ y*u,k) * argmin { -lnty-x) -k _1 ii(ln(*(e *- 1) +y -x) - InCy-x)) \y>x) 
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Also for $ and Q - 0(y t u,k) - (y-$)(k(e s - 1) *y - V and t~y\u-u* |*‘ l smallmough 
we have |$ - x* | i 2k(k-y-l)' l t, | u-u* | s 2 *Ck + 1)Cv(*-^- 1»' 1 /, Le the following 
estimations 

|$-x* | ( 74 > 

1 *> - m * | i2(y*\)(k-y-iy l \u-u f \ (7.5) 

holds true. 

Now we apply the MBF to the same problem (7.3). We obtain F(xji£) *x ~k *11 ln(t(e * - 1 ) + 1) . 
So F(x \u \k) »/ 0 (* *) =0 , F'(x,u *Jt) = 1 -e*(k(e * - 1 ) + 1 ) * , F x '(x > \k) - 0, 
F"{x,u,k) - (* - 1 )ue x (*e * - * ♦ 1 )' 2 and F" (x > *Jc) - * - 1 .Note that for any * > 1 the 
MBF F(x,u,k) is strongly convex at the solution while the Classical Lagrangjan L(x,u*) m 
x - e* + 1 is strongly concave. Then (see [Pol92] pi 85) 
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Due to Theorem 1 ( see [Pol92 ]) for all («, k) e D (• ) there exists $»$(i<,*)» 
argmin{F(x,i/,Jt) |xeA *} and for $ and 0 • ii(*(e*-l) ♦ l)' 1 the foUowing estimations 
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(7.6) 


|$-x*| i2(k-iy l | m - 1 / * | , 

| 0-u* \ i2(k-iy l |i /-«* | (7.7) 

holds true. Comparing the estimation (7.4) and (7.6) we find that the extra tool in the MIDF has an 
essential influence on the rate of convergence for the primal sequence. By changing the ’’center" y it 
is possible to speed up the convergence of the primal sequence even with fixed parameter k > 0. 


8. Shifted Interior Distance Function fSIDF) The SIDF one obtains from the MIDF by setting 
u = e = (1,...,1) € R+" , i.e. the SIDF, which corresponds to F(x,y,u,k ) is defined by formula: 


V(x,y,k) = F(x,y,e,k) =(-l +k- l m)]n(f 0 (y)-f 0 (x)) - k ~ l E \n(k/ t (x) */ 0 (y) ~/ 0 (x)) 

( 8 . 1 ) 


rf-i 


= (-1 +k~ l m)\n&(y,x)-k~ l T, \nd t (x,y,k) 

<-i 


If A1 and A2 are taking place then the set Q t (y ) is bounded For any k > m and y e Q T SIDF 
<p (x, y, k ) is convex. 

Therefore there exists 

*(•) * JrO.^) ~ argmm { <p (ar,^, A:> /.re R* > 
and 

- 0 (8 2 ) 

i»l 

By setting 

«,(•) * u,(y,k) = A Cy,jr(*))</ < ‘ l (x(*), •),/ = !,...,/» (8 3) 

and «(•) = («,(•) , i = 1 m) we can rewrite (8.2) as follows: 

- o’ (8 4) 

/■l 
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The following proposition is taking place. 

Proposition 5 . If conditions A1 and A2 are satisfied, then for any monotone increasing sequence 


{ k, } : k, > m , lim k, = » and y e Q t : 

1) the sequence {w (y,k t ) = ( x (y, k, ), u (y, k, ))} is bounded and any limit point (x, u) of 
{*'0'.*,)} isaK-K-Ts pair 

L'(x t u) - 0 , uj^x) = 0 , u t *Q , i = l,...,m , 


i.e. x = x * , u = u * 

2) lim <f>(x(y,k t ),y > k t ) = q»* - ln(/ r 0 O') -/ 0 (* *» _1 *«■ / 0 (*‘) “/oOO " «P("<P*) 

Proof Due to the Corollary 20 (see [FiacM68]p94) it follows from assumptions A1-A2 that the set 
(y) is bounded, therefore in view of the inclusions Q k (y) Q^iy) => ...=> Q^Cv) 3 0*^00— 
the sequence (x (y, k, )} is bounded for any y e Q r So it contains a converging subsequence We 
can assume without loosing generality that 

lint x(y,* t ) ** 

It is clear that x depends on^ € int Q , we will omit this indication to simplify notations. 

Then lim <p(*(*)»*) = It® ty(*(y*k t ),y>k t ) = lim [(-1 +mk t 1 ) ln(/J 00 -/j (*(•))) 

- £ in (/(*(•)) * (A<y) -/ 0 (*(O))*;') ♦ to *, 1 

• lim[-lnOSOr)-/tx(.)))-t;‘ Eln/(*(.))] (85) 

|.. 1-1 

Therefore x e 0 , moreover x 6 d Q, i.e. there is at least one index t :/(x) = 0, otherwise we would 
be able to find A > 0: 

f t (x(y>k,)) ^ ^ “ 1 

for any large enough k,. 
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Therefore using formula (8.3) for the Lagrange multipliers we obtain 

lim u t (y,k t ) - 0 , / = 1 

V - 

So, from (8.4) we have /£(x) - 0 xc Q which is impossible, because (1.1) is a constrained 
optimization problem. 

Hence, x 6 dQ and we will assume that 7 = { t :/ t (x) = 0 ) is the active set. 

Now we want to prove that {«(•) = « (y, k,) } is a bounded sequence. Assuming the opposite we 

m 

can find /:«,(•)- O'. *,) - 00 Dividing both sides of (8.4) by E w, ( •) we obtain 

/•I 

( z: «,(•))■'.£«•)) - £ ♦*;*(• -»(E» l (*)r , )^(*(o) - o* <8 6) 

i-1 M 

where */,(•) = tt i (*)(E« j (*))' 1 ,/ = 1 ,•••,« 

Taking a limit in (8.6) we obtain 

E Hji (x) = 0 and u ( * 0 , / e 7 (8 7) 

_ 

and not all u t - 0. 

However, (8.7) is impossible because it contradicts Slater condition (1.2), so {u (y, k , )} is bounded. 

Without losing generality we can assume 

u = lim u(y,k t ) 

K* m 

Taking the limit in (8.4) we obtain 

/J(x) -E«j/ / (x) = 0 and u t f t (x) = 0,/ = l,...,m 
In view of u 6 K? and x e Q we have x = x * , u = u * i.e. (x,u) is a K-K-Ts pair. 

To find lim <p (xO',k # ), < y,Jt # ) we consider first the passive contraints / :/(x)> 0. It is clear 

V- 

that for y € Q t 

lim - o , i i / (*•*) 
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Now we are going to consider the active constraints keeping in mind the formulas for the Lagrange 
multipliers (5.7). We have 


0 z u, = lim 


/oOO -/o(*(*)) 


— *,/;(*(•)) +/ 0 (y) -/o(^(0) 


iel 


For > 0 we obtain ^(*(*)) *0 (it/ 1 ), for u t = 0 we have lira*,/(*(*)) = therefore 
for any y € Q , 

limit / 1 ln/^(*(*)) -0 ,iel (8.9) 

Using ( 8 . 6 ) and taking into account ( 8 . 8 ) and (8.9) for any y e Q T we obtain 

9 * = lim 9(*0'.* f X>'.*,) * li»[ "In (^00 -fo( x (y> k J) “*» * E 

u7 

= lim [ -ln(/ 0 Cv) -f 0 (x(y,k t ))] * In [f 0 (y) -/ 0 (* *)]-* . Therefore f 0 (x *) =/ 0 Cv) - e 

I- — 

If for the problem (1.1) the standard second order optimality conditions are satisfied then the 
following statement, which is a corollary of the Basic Theorem, is taking place. 

Assertion 2 If the second order optimality conditions (1.4) - (1.5) are satisfied, then there exists 
k 0 > 0 such that for any yeQ , and k i k 0 A(y,x*) + E u* > m the following statements are taking 
place. 

1 ) there exists a vector 

x(y,k) ■ argmin { 9 ( x,y,k )/ x e R" } 
such that 9 ' (x(y,k),y,k) » 0 *. 

2 ) for the pair of vectors x(y,fc) , u{y t k) ■ (u t (y,k) * A (y,x(y,k))d, 1 (x(y,k),y t k) , i = 1 m) 

the following estimation 

maxi | jeO.Jfc) - x * \,\u(y,k)-u* |} s ck' 1 ACv,**) ( 8 . 10 ) 
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holds and c > 0 is independent on k and y. 

3) the shifted interior distance function <p (x, y.k ) is strongly convex in a neighborhood of x(y,k ). 


9. Numerical Realization of the MCM The numerical realization of the MCM requires: 

1) to find a "center" y e int Q , ; 

2) to find a triple (y, u, k) e D (• ) ; 

3) to replace the infinite procedure of finding $ = $(y,u,k) for a finite procedure, which retains the 
contractibility properties of the operator C y k . 

To find> € int Q T we consider the following convex programming problem 

* / = I.* > m) ( 91 ) 

Starting with a "warm" start (x°,x^ { ) : r®, <min{/(jf°)/l thin) we apply any IPM for 
solving (9.1) up to the point (x, x Myl ) then we set y = x . 

To find a triple (y.u.k) e D (• ) one can again apply an EPM, starting with* =y,u=e m 
and k> m and increasing the barrier parameter followed by a Newton step for solving the system 
V' x (x,y, £) = 0" inx. 

The other option is to find an approximation for a vector x(y, k) .y' x (x (y, k),y, k) =0", 
using smooth unconstrained optimization techniques. In particular, one can use Newton method with 
step length, which we will describe later. 

Due to Proposition 5 or Assertion 2 to find ( y, u, k ) e D (• ) it is enough to find an approxi- 
mation for x ( y, k ) and u ( y, k ) when k>m is large enough. 

Also the parameter k > m has to be large enough to guarantee that C y k is a contractive 
operator for a given y e Q „ i.e. 0 < y* * = c k ’* A( - y, x*)< 1 . The constant c > 0 is associated with 
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the input data and the size of the problem and apriori unknown as well as the value A(.y,x*) for a 
chosen y e 0 ,. 

So there is no explicit way to find for a given 0 < y < 1 a "center" ye Q, and k > 0 such that 
y < ck' 1 b(y, x * ) < 1. 

Therefore we have to find an implicit way to adjust y e Q t and k > 0 to an apriori chosen 
0 < y < 1 to guarantee the inequality y < ck' 1 &(y,x')< 1. 

First, let's consider for a fixed y e Q , and k > E u, , a non negative function v(w,y,k) = 
v(x,y,u,k): W yk =Q k (y)* U yi * - R, 1 , which is defined by formula 

v(w,.y,jfc) * max { | F'(x,y,u,k) | , max { -f t (x ) } , E u t \f t (x) \ ) 

1 itiM 1-1 

The following proposition is a consequence of the K-K-Ts optimality conditions. 
Proposition 6 . For a convex programming problem (1.1), any fixed y € Q„ ue R+" and k>H u, the 
following statement is true 

v(x,y,u,k) = 0 — (*,«) = (x *,«*) =w* (9.2) 

Proof . First, v(x,y t u,k) 3 0 max { ~f t (x) / 1 sliali 0 , i.e. -f t (x) s 0 or 
/(x) i 0, / = l,...,m, sox e 0 

Second, v(x, y, u, k) * 0 — Ei/, [ f t (x) | =0, i.e. u t | f t (x) \ -0, i= 

i-i 

Therefore for any 1 s i i m 

u t >0 -f t (x) * 0 andyj(x)>0 -•> u t = 0 (9 3) 

Third, v(x,y,u,k) = 0 - | J r x / (x,}',w,*) | = 0 
Therefore in view of (9.3) we obtain 
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F'(x,y,u,k) 


l-k' l Zu t , A «, L/f(* )-*~ Vo(*)] 
/«(y)-/o(*> J5( * ) m W t (x)+f Q (y)-f 0 (x) 


= A- 1 CK,Jc)(^(af)-E« < /(x))=0- and A(y,x)>0 


« 

Hence, v (x, y, u, k ) = 0 leads to x e Q and L*(x,u) = 0" , E u^ix) * 0 , u € R + , so 

i-i 

w = (x, u) = (x*. u* ) = w* . 

Let's consider the second part of the statement. We have to prove that v (x*. y, u,k) = 0. 
First, x* € Q , therefore max { -/ (x* ) / 1 i / i m } s 0, then in view of (x\ u ) is a K-K-Ts 
pair, we have Lu ( * |/(x*) | = E u’ f (x* ) = 0 and due to the Proposition 2 F x ( x*. y, u,k) = 

A' 1 (y, x * ) L, ( x*, «* ) * 0* . Therefore v ( x*, y, u,k)- 0. 

m 

For a fixed "center" y e Q T and a fixed barrier parameter k> E if/ one can consider the 

/■I 

non-negative function v(w,y,k) : W yk = Q k (y) * U yk - *! as a merit function, which measures 

the proximity from w = ( x, u ) to w* = ( x*. u ). 

Due to the smoothness of f, (x ), / = 0, and boundness of W y k there is a constant L that 

v(w,y,k) = v(w,y,k) - v(w \y,k) sl|w-w*|. v w c w y ,k ( 9 4 ) 

Without loosing the generality we can assume that Li 1 . 

Further, if for a given 0 < y < 1 the "center" y € Q t and the barrier parameter k > 0 are 
chosen appropriately, i.e. y**- c**' A(* x* ) < y < 1 then, due to (6.4) for any u e U yk we have 
max< |$-**|. y |« -«*l 

and again u e U yk . 

Let's assume that for u° e U yk we have | u 0 - u * | i 1 , then MCM (5.6) - (5.7) produces 
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a sequence { w* = ( x 1 , u 0} : 

max{ | jc *-x* |, \u ' -u* \ ) i y‘ (9 5) 

Then in view of (9.4) we obtain 

v(w \y,k) = v(w \y,k) - v(w \y,k) s y' (9 6) 

In other words if for a given 0 < y < 1 the "center" and the barrier parameter are such, that 
y y k s y , then { y' ^ is a majorant for the sequence { v(w \y,k) IJ.i 

We will say that the pair (y, k ) is consistent with a given ratio 0 < y < 1 if (9.6) is taking 
place for all s z. 1 . 

To check the consistency of a chosen pair (y, k ) for a given ratio 0 < y < 1 one has to solve 
infinite number of unconstrained optimization problems. Moreover, each problem requires infinite 
number of arithmetic operations. 

So, first we will show how to keep the estimation (6.4) without solving an unconstrained opti- 
mization problem at every step. 

Then we will show that the numerical realization of MCM does not require the consistency 
of chosen couple (y,k) with a given ratio 0 < y < 1 from the beginning of the process. We will 
achieve the consistency by adopting the barrier parameter in the process of solution. 

Let's consider a number 0 >0, k>T.u, and a pair (x,y). 

I F'(x t y,u,k) | s 0 k ~ l \ A (y,x ) d~ { ( x t y,k ) u-u\ (9 7) 

We update the Lagrange multipliers by the following formula 

u ^ A(y,x)d’ 1 (x,y,k)u (9 8) 


The following assertion is taking place. 
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Assertion 2 If the second order optimality conditions (1.4) - (1-5) are satisfied then for any p > 0 
and y € Q t there exists such k 0 > 0 that for any ( y, u, k ) € D (• ) the following estimate holds 

max( | jc — jc * | , | « - «* | > ^ cA: 1 A 0 ,jc*)(1 +0) | « - m * | (9.9) 

The estimate (9.9) can be proved by a slight modification of the considerations, which have been use 
to prove Lemma 2 ( see[Pol92]). 

So the co nsist ency of a couple (y, k ) with a given ratio 0 < y < 1 one can check using instead 
of (9.6) the following inequality 

v(w f ,}\it) sy' , iil (9. 10) 

where w* = (jc*, u* ): 

lF^x-',y,u‘,k) | s 8 * '■ | A Cv,*" 1 ) d(x‘' 1 ,y,k) u' - u' | (9.11) 

5'*‘-AC y,x , ' , )d-'(x"',y,k)Z' (9 12) 

Due to the Assertion 2 for any fixed "center" and a given ratio 0 < y < 1 there exists a threshold 
ky yikt that for any k i k y r we have ck ~ x A (y,x *)( 1 + 0) s y < 1 u € U y ,k “* u € U y ,k 
In other words, for k z k y r > k 0 and u e U y k : 

1) the approximation x that satisfies (9.7) can be found for a finite number of operations; 

2) after every Lagrange multipliers update, the distance from the approximation x and u to 
x* and u shrinks by a factor 0 < y < 1; 

3) the new vector Lagrange multipliers u belongs to U y k again. 

So, the inequality (9. 10) holds for any 5 * 1, i.e. the fixed couple (y, k ) is consistent with a fixed 
given ratio 0 < y < 1. 

We would like to emphasize that the convergence of the method (9. 1 1) - (9. 12) is not due to 
the "center" or barrier parameter update, but rather due to the Lagrange multipliers update. 
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Therefore when it comes to numerical realization of the MCM (5.6) - (5.7) the main problem 
is to find for a chosen fixed "center" y e Q T and a given ratio 0 < y < 1 the threshold k y r > k 0 that 
for any k z k y r the couple ( y, k ) will be consistent with 0 < y < 1 

In the procedure describe below the "merit" function v ( w, y, k ) is the key element, which 
we are going to use to adjust the barrier parameter to the level, which will make ( y, k ) consistent 
with 0 < y < 1 while the "center" y e 0 1 and the ratio 0 < y < 1 are fixed. 

To describe the MCM we introduce the Relaxation Operator R : Q* (y ) - Q* ( y ) by formula 

Rx = x 

where x is defined by (9.7). 

We start with a "center" yeQ T , we set x° = y and We choose the ratio 0 < y < 1 

and a monotone increasing sequence { k t } : k^ > m , k t < k^ , lim k, = » Let e > 0 as the required 

accuracy. 

Let’s assume that the pair ( x\ u‘ ) and the barrier parameter k - k g have been found already. 
The next approximation ( x * , « ,+l ) and we find using the following operations. 

1) x : -x 1 , u : * 

2) x : =Rx, &(y>x)(T l (x,y,k)u ; 

3) if v(w,y,k) £ t , then x * = x , u * = u and stop; 

if v(w,y,k)>t 

4) a) if v(w,y,k) £ y 1 * 1 and *>£«,, then -k t x‘* x =x, u'* 1 =«, 
s+ 1: -s and go to 1. 

b) if v(w,y,k) £ y 1 * 1 , but * * E u, , then k : = k g +1 * E u t + 1 , 
x' +1 = )y* = max{li/>0:.y+/(x->')e Q*00h 
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u'* 1 = u , j + l : = s and go to 1. 

c) if v(w,y,k)> Y Itl , k = max{* /t , , Eu ( + H, 

x( 5 ) = argmin {f Q ix ') / 1 sizs) , ? #+1 = max { 1 i/>0 : 

y + t(x(s)-y)e Q k (y)}, x : = i t+l x(s) + (1 -'t^)y, « = e. and go to 2. 

It comes to the point when^ and k became consistent with 0 < y < 1, then every application 
of the relaxation operator leads to xe that u = &(y,x)d~ l (x,y,u)€ U yk . 

From this point on the "center" y e Q , and the barrier parameter k = k f is fixed, every 
i ^gr ange multipliers update shrinks the distance from x and u to x and u by a factor 0 < y < 1 and 
ueU yk ^ueU yk . 

The numerical realization of the operator R is based on smooth unconstrained minimization 
technique. 

We will describe it based on Newton Method with a step length for finding an approximation 
x for the £ = ${y,u,k) = argmin if{x,y,u,k) / x e R" }. 

The Newton direction { we find from the system 

F"(x,y,u,k)t = - F'(x t y,u,k ) (913) 

The step length t we find using Armijo rule. We check the following inequality for / = 1 . 

Fix +tC,y,u,k) - F(x,y,u,k ) s VitiF'(x,y,u,k), f ) (9. 14) 

If (9.14) is satisfied we set 

x : = x + r £ ( 915 > 

if not, we set t . = Vi t and check (9. 14) again up to the point when (9. 14) is true, then update x by 
formula (9. 15) and go to the first phase of the Newton Method - find the Newton's direction {. 
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We continue the process (9. 13), (9. 1 5) up to the point x , which satisfies (9.7). 

Note that if x is within the Newton area of x(y, u t k), then it takes only 0 (In In e' 1 ) Newton 
steps to get x . 

In view that from some point on both the "center" y € Q , and the barrier parameters k > 0 

are fixed and ueU t u eU . the primal approximation will remain in the Newton area for the 
y** y»* 

system F x ( x, y, u, k ) * 0 in x after every Lagrange multipliers update. 

Such a point we call "hot" start. From this point on at most 0 (In In e' 1 ) Newton steps 
requires for every Lagrange multipliers update. Each update shrinks the distance from the current 
primal - dual pair (or, u) to the primal - dual solution ( x* , u ) by a factor 0 < y < 1 So from this 
point on it takes 0 ( In e" 1 ) • 0 (In In e' 1 ) to get the approximation to ( x* , u ) with accuracy e > 0. 
To reach the "hot" start sometimes one has to increase significantly the barrier parameter. 
For a very large barrier parameter, the MIDF became close to IDF, so in such case we might 
lose the the potential advantages of MIDF. 

Also, as it was shown in Section 5, MCM is close to prox method with entropy-like kernel. 
It is well known that prox methods converge with linear rate and again to decrease the ratio one has 
to increase the parameter. 

On the other hand being a Classical Lagrangean.for the equivalent problem the MIDF 
possesses properties which make them fundamentally different from the Classical Lagrangean 
L(x,u) for the initial problem. 

Therefore we can expect that the dual function and the dual problem, which is based on 
MIDF, might have some interesting properties, which will enable us to improve the convergence 
without increasing the barrier parameter. 
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In the next section we are going to consider some aspect of the duality theory, which is based 


on MIDF. 


10. Dual Problems Based on MIDF. The MIDF, as we mentioned above, is a Classical Lagrangian 
for a problem, which is equivalent to (1 . 1), but not every result of the duality theory can be translated 
automatically when instead of the Classical Lagrangians for the initial problem, one considers the 
MIDF. The situation at this point is different from the situation with the Modified Barrier Functions 
(see [Pol92 ]), which are also Classical Lagrangians for a problem which is equivalent to the initial 
problem (1.1). 

On the other hand, using MIDF, it is possible to obtain some new important characteristics 
for the dual functions and dual problems, which are impossible to obtain by using the Classical 
Lagrangians for the initial problem. 

We are going to start with the basic optimality criteria for the convex programming problem. 
Theorem 3 . Let for the convex programming problem (1.1) the Slater condition (1.2) is satisfied , 


then 

1) if for y e intQ and k > 0 there exists a vector u - ( u\ uj * 0" such that 

«,*/(**)=<>, W » »d F(x,y,u*,it)i/ r (x> > «*,*),Vxel*:ACK,x)>0(10.1) 

then the vector x* is a solution of problem (1.1). 

2) if f (x ) e C 1 , / ■ 0, 1 ,..., m, and if x* is a solution of problem (1.1), then there exists u a 0 

such that (10. 1) holds true for any y € int Q and k > E u*, 

3) if (x*. u ) is a saddle point of the MIDF F(x,y,u,k ) : 

F(x,y,u •,*) i F(x \y, u ',*) » F(x \y,u,k) . Vx £ V : A (y,x)> 0 , u e «*, (10.2) 
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then (x*, «*) is the K-K-Ts pair for any y e int Q and k > 0. 

4) if (x*. y ) is a K-K-Ts pair then the pair (x*, t/*) is a saddle point of the MIDF F(x,y,u,k) 
for any>» e int Q and k > E u*. 


Proof : 

1) Let (10. 1) holds true, then 


-In A C V,x)~k' 1 E u* []n(kf t (x) + A 0\x» - In A 0\x)] 
i -lnA(>',x*)-^" , Ett/ln[^A' 1 Cy,x*) < /J(x*) + l)] 

* - In A (y,x * ) , 


i.e. 

-lnACv,x)i -InA 0\x *) E tt/»ln[^A' l Cy,x)/J(x) + l] 

<•1 

In view of A (y, x ) > 0 the last term at the right hand side is non-negative for V x € Q, therefore 
lnA(y,x) s In A (y,x *) -/ 0 (x) */ 0 (x *) , Vx e Q : ACv,x)> 0 
and x* is the solution of problem (1.1). 

2) ifx* is the solution of problem (1.1), then there exists if* i 0* such that k',/ ( x* ) = 0, r = 1 
and L' x (x, u ) * 0", therefore due to the Proposition 2 we have 
F,'(* \y.u •.*) - A-'O-.* 1 ) i.'(* > 0 * 0 

The MIDF F(x,y,u',k) - ( - 1 A (y,x) -k'Xu’ In (*/;(*)♦ A O',*)) is convex 

function in x e R" : A (y, x ) > 0 for anyy € int Q and * > E u*. Therefore F(x,y, u,k) * 

F( x*, y, u, k ) for any x e R" : A (y, x ) > 0. 
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3) if (x*. u* ) is a saddle point of the F(x,y,u,k ), i.e. 

-InACy.x)-*' 1 E « < *[ln(^(x) + A(y,x))-lnACy,x)] 

M 

i -InACv.x*)-** 1 !) t//[ln(*:y;(x*) + ACK,x*))-lnACv > x*)] 

l m { 

i -lnACv,x*)-if 1 Eti < [ln(^(x*) + A(^ > x‘))-lnACv > x*)] 
then x* e Q In fact, suppose the contrary, i.e. there exists /„ f t {x * ) < 0, therefore 
\n{kf t (x * ) + A (y,x * )) < 0 for y e int 0 and *>0. So by fixing u, = 0, i* i 0 and increasing 
u we obtain a contradiction with the right side of the inequality (10.2), so/ ( x ) * 0 i = l,...,/n. 
Now we will prove that if (10.2) holds true, then u'f (x* ) - 0, i - 1,..., m. In fact for u = 0 

Ml 

from the right side of (10.2), we obtain E u* ln(* A '* (y,x * )/(x *) + 1 ) s 0 but we just proved 

/-I 

that / (x*) i 0, i - 1 ,..., m, therefore ln(*A _, Cv,x*)./;(x*) + l) * 0,i = \,...jn and 
EM ( *ln(JtA‘ , Cv.x*)/(x*) + l) i0, so for any y € int Q and * > 0 we have 

fl 

n/ln (*A ~ l (y,x %(x *) + 1) = 0 , / = 1 Hence u,/(x*) = 0, i - 1,..., m and from the left side 
of the inequality (10.2) we obtain 

-In A (y,x) i - In A O',* * ) ' l E ir/ ln(fc A " l (y,x)/ t (x ) + 1). 

In view that for any x e Q : A(y. x)>0we have ln(*A' l O\x)/(x) + 1 ) i 0 we obtain that 
-lnACv.x) i -lnA(>,x*) or 

In (/„ O') -/„(*)) < hOiW-/,(J'))-/,(*)>/,(»'). Vxe Q : A (y,x)>0 
i.e. x* is the solution of problem (1.1). 

4) First of all, note that if ( x* u ) is the K-K-Ts pair then x* is the solution of problem (1.1). Using 
the same consideration as in 2) we obtain F* x ( x*. y, u , k) = 0 and F (x , y, u , k) = \n &(y, x ) 
for anyy € int Q . Taking into account the convexity F(x, y, u, k ) in x for any k>T.u, we obtain 
F(x,y,u\k) i F ( x*. y, u,k). Vie Q*(.y), and due to the definition of the function 
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F ( x, y, u, k ) we obtain 

F(x,y,u,k)i F ( x", y, u, k ), V x e R" : b(y, x ) > 0 
Also k _l E u t ln(£A _1 ( y,x * )/ t (x *) + 1 ) i 0 , / = 1 Therefore 

F(x\y,u\k) = - ln(/J(y)-/ 0 (**» 

■ F ( x* , y, u, k) , V u e R\ . 

So ( «* ) is a saddle point of the MIDF F(x,y,u,k) for any y e int Q and k >£ i/,* 

We have completed the proof of Theorem 3. 

Remark 4 . We want to emphasize again the difference between MIDF F '(x, y, u, k) and MBF 
F(x, u, k). The statements 2) and 4) are not true (see Section 7, example 1) without the condition 
k >E I#/ while for the MBF (see Theorem 4 [Pol 92]) the statements 1) - 4) are true for any k > 0. 

Now we are going to consider the dual pair constrained optimization problems, which are 
based on MIDF. Let y e int Q and k >E u, be fixed and ♦, *(*) = sup F(x,y, u,k). Then 


b 


IntfoOO -/ 0 00). tf/oOO -/ 0 (*)>0 i = 


otherwise 


and the initial problem (1.1) reduces to finding 

x* = argminf lJr^Cx) |xe R* ) (10 3) 

We define the dual function by the formula <p^ t (n) = inf F(x,y,u t k). Then the dual to (1.1) 

uV 
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problem consists of finding 

u *= argmaxf <p y4 (w) | u±0) (10 4) 

Due to the definition of * (x ) and <p,. * (u ) for any y e int Q and * > 0 we have 
- In C/ O 00 -/ O 00) = +,.*(*) * <P y .*(«) , Vx e Q : A O',*) > 0 , « e R*. 

Therefore if x and u are feasible solutions of the primal and dual problems respectively and 

* y>i (x) = * y ,*(*0 then x = x * and u = u * 

The smoothness of the dual function q> y> *( u ) depends on the smoothness of / ( x ), i = 0, 
1 ,...m and convexity properties of the MIDF F ( x, y, u, k ) in x € R". 

The next theorem describes the smoothness properties of the dual function <p yi (u ). 

Theorem 4 . If (1 . 1) is a convex programming problem,/ (x ) e C* , i~ 0, 1,..., m and conditions 
(1.3) - (1.5) are satisfied, then there exists *<, such that for any fixed ^ € int Q and any fixed k * 
k 0 b(y,x') + E u * : 

1) the concave function q> y £ u ) is twice continuously differentiable in U y k . 

2) the gradient of the dual function is defined by formula 

* ^(Sty.u,k),y,u,k) - F„'(}(*).0 

= -t- , (ln(*/(S(.))A-‘0-.S(«))*l),-.ln(t/.(S(*)A l O'.S(»))*l)) (105) 

3) the Hessian of <p y *( u ) is defined by formula 

<p£,* ) m («) - ( 106 > 

where F"($(*),*) - F"£(*(*>.*) 

Proof ; First of all, note that <p y _ *( u ): R," - R is a concave function whether or not the functions 
f t ( x ) , i ■ 1,..., m are convex. If the standard second order optimality conditions (1.3) - (1.5) are 
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satisfied then due to Theorem 2 the function F(x,y,u,k) is strongly convex in the neighborhood 
of x = x(y,u,k) for V (y,u,k) e !>(•) 

Therefore $(y,u,k) = £(•) is a unique minimum of F(x,y, u, k) in x while <p ,.*( u ) = 
F(x(y,u,k),y,u,k) is smooth in ueU y k i.e., there exists the gradient <p'(y. *)«( u ) = 

(•)»•)■ Taking into account F'(x(*),*)=Q we obtain 


<P (**>.(“) =F « (*(•),•) - 


a< *> (,.*)(“> 

dtt, 


du_ 


= -Jfc _l (ln(Jb/| ($(•)) a + 1) to(*/.a^(*»A- l Cy^(-)) + O) 


Further $(y,u \k) =x * and *,*)) =f t (x *) = 0, / = W , therefore 

<p^ > (tt*)*-Jt"H , n(ilr/ l (Jf*)A' , CV,x*) + l) *)A" 1 CV» Jt *) + 0. 

In (*/ r ♦ 1 (* *) A ~ l O',* *) + 1 ) «• ••» In (*/* (* *) A ■* O'.* * ) + 0 

= (0, ,0, -h' x \n(kb~ l {y,x*)f r . x (x *) + l -t^ln^A^Cv.x *]/«(■* *) + 0) 

Since the matrix /^(x(*), •) is a positive definite for any fixed y € int Q and k z k 0 A( y,x ) + 
Y u* the system F , I ( x, y, u, k ) — 0 yields a unique vector function x (y, u, k) such that 
F^(x(y,u,k),y t u,k) ■ F'(x(»\») ■ 0\ V« e U yk and x(y,u \k) = x * 
Differentiating the last identity by u we obtain 

F£tf(y,u,k),y,u,k)# u (y t u t k) + F" u {${y,u,k),y t u,k) = O'" 

therefore 

($(•).'), Vue C/,,. 

Hence 
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(10.7) 




F"(l(-),‘) ‘ F"' ($('),•) - 

To compute <p" 0 ,. *>«(«* ) we first consider 

F"(S(y.u \k),y,u \k)-F"(.x \y.u\k ) =■ 

-<<-'(* \y.k)f'(x ')-A~ l (y,x')D(x')d~ l (x' ,y,k)F m (x * ) > 


A'Cv,* - )/', 0 


r,M -r 


\w-r,r 


d.'rix' ,y.k)) 


/(*•)- A-' Cv,*') 



\ 

0 r.«t-r 

[diag^(x*)]Z. r+1; 




( 10 . 8 ) 


then 


<p" t) „(u •) - -F"(* -,y.u\k)F"(x \y,u ",k))-'F*(x \y,u \k) • -F"(F"r l F" 


Remark S . The dual function <p(u) = inf L(x,u), which is based on the Classical Lagrangian 

x« 

L(x,u) for the initial problem is in general nonsmooth under the conditions of Theorem 4. It becomes 
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smooth, if for example,/, (x ) is strongly convex, while the dual function <p (y *, ( u), which ts based 
on the MTDF F ( x, y, u, k ) is smooth even for the nonconvex programming problem, when along 
with (1.3) - (1.5) the growth conditions (see [Pol 92] p. 181) are satisfied and k i k 0 h(y,x ) + E u, . 
The next theorem establishes the main result of the duality theory which is based on the MIDF. 

Theorem 5 . Let / 0 (x ) and all -/ (x ) be convex, then 

1) if/ ( x ) € C l , / - 0,.. ., m and the Slater condition holds, then the existence of the solution x* of 
the primal problem (1.1) implies the existence of the solution u of the dual problem (10.4) and 
Vy k (x* ) = V* * ( u * ) f° r y € int Q and * > E u, 

2) if/ ( x ) € C 2 , / = 0 ,..., m and the optimality conditions (1.3) - (1.5) are taking place, then the 
existence of the solution u e U y , k for the dual problem implies the existence of the solution x for 

the primal problem and 4r r * (x* ) = <P* * ( »’ ) for any^e int Q and* * k 0 A(>\x ) + E . 

3 ) if j ( % ) e C* , i - 0,..., m and the second order optimality conditions (1.4) -(1.5) are satisfied for 
the primal problem, then the corresponding conditions are taking place for the dual problem (10 4) 
for any y e int O and k a k 0 A (y,x ) + E u, . 

Proof: 

1) Let x* be a solution of problem (1.1), then there exists (see Theorem 3) a vector u such that 
(10. 1) holds true for any y e int Q and * > E «,*. Hence for any u a 0 we obtain 

<p k (u * ) = min F(x,y,u * ,k) = F(x \y,u \k) = 

* *«r 

i|r t (x * ) a Fix * ,y,u,k) a min F(x,y,u,k ) = <P y .*(«) , V u e R* 

y * x < IT 

so u is a solution of the dual problem and *( x ) “ Wy, 
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( 10 . 9 ) 


/ 0 (* * ) =/oCv) - «p( - VyA U *» 

for any y € int Q and k>T, u*. 

2) Conditions (1.3) - (1.5) imply that the function F(x, y, u, k) is strongly convex (see Theorem 2) 
in the neighborhood ofx(y,u,k) - argmin {F(x,y,u t k) \ x € E"), therefore the vector x(y,u,k) 
is unique for any ( y, u, k ) € D ( • ) and because of the smoothness of f (x ). / = 0,..., m the 
gradient <?'(,,*)„(«) exists. 

Let u € U .be a solution of the dual problem (10.4) and x = x(y,u,k ) . Then the optimality 
conditions for the dual problem (10.4) are fulfilled at u, i.e. 

-*' l On(^) + ACv,x)-lnACv,i)s°, for / : ^ = 0 
q >' y k)H (Z) * - k ■* (\n(kf t (x ) ♦ A 0>,x) - In A O.i ) = 0 , for / : u t > 0 
Then S>0 implies \n(k/ t (x)^(y,x)-]n£i(y,x)^0 -f t &)*0 whUe ^>0 implies 
hi(kf t (x) + A Cv.jc)) = In A (y,x ) -/(x) = 0 , i.e. x e Q and forthe pair ( x , u ) the complemen- 
tarity conditions f t (x)u ( - 0 , i - hold. Therefore 

i.e. for the primal and dual feasible pair (x, u ) we have 

<p y .*(«) = ♦**(*) * -In A (y t x) 

hence x = x * , u * u * 

3) We will now show that the second order optimality conditions hold for the dual problem (10.4) 
in the strict form . 

First, we note that the gradients e t = ( 0,... , 0, 0,..., 1 ..... 0) , i = r + of the active 
constraints u, z 0, i = r + 1,..., m of the dual problem are linearly independent. Later we will prove 
that the Lagrange Multipliers, which correspond to the active constraints of the dual problem are 
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positive. Along with the linear independence of the e, , i = r + 1,..., m , this forms condition (1.4) for 
dual problem (10.4). Now we will prove that the condition type (1.5) is also satisfied for the dual 
problem. 

ft 

Let us consider the Classical Lagrangian L^ k ^{u,k) * <P( y> t)( u ) + Ej k ( u l for the dual 
problem (10.4). The Hessian of the Classical Lagrangian for the dual problem is £"(**)•« (u, k) = 
<p” (y ^^(u) and the second order optimality condition for the dual problem (10.4) is 

(L^ k)ltu (u *,X*)v,v)s -p(v,v), Vve R" : (v.e,)=0,/=r + , p> 0 (10.10) 

To prove (10.10) we first consider the matrix F n xx = F"„ ( x*. y, u, k ). Taking into account 
conditions (1.4) - (1.5) for the primal problem, we obtain from Theorem 1 that 

mineigval F„ = p A' 1 (y, x*) > 0 
for any,y e int Q and k i k 0 h(y,x*) + E u*. Let 

maxeigval ( L"(x \« * ) ‘)U r ‘^(x-)) - M 

Then maxeigval F”„ =A/A'* (y, x\ so for any w e R" we obtain 

p' 1 A (y,x * )(w, w) i (F"2 w, w) * AT" 1 A (y,x * )(w, w) 
-p’ 1 ACv,x‘)(w,w)i(-F // ^ 1 w,w) s -AT'ACK.x’Xw.w) 

Therefore due to (1 0.6) 

(A**>«(* '. r ) v > v ) - (*£»>-(» ') T - v > = 

(F"(-F"y'F“ v . v ) >(( -F"y'F"x.F a v) 

Let v = ( v„..., v. X then ( v,e y ) * 0 - v, - 0, therefore any vector v e R" : ( v, e , ) - 0, / - r + 1,.,/n 
has the form v = ( v„..., v r , 0,..., 0 ) = ( v (r> 0,..., 0). Therefore, due to (10.8), d l (x*. y, k ) v = v and 
D(x*)v = 0" we have 

F"(x‘ t y,u\i)v--/"(x‘)J- , (x‘.y.*)v-*- , ly.x')Fj(x , )<l- , (x‘.y.t)D(x’)v 
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* - A '* (y t x ‘ )f ,r (x ‘ )v 


So 


(I "(« *, V,y,k)v, v) = ((F"y l F"\ t F"v) 
i -A _1 Cy,x* )AT l (F"v,F"v) = - a - l c v,* ‘ )M-‘ (/£,(* * )v (r)> / / ; ) (ot * ) y (r) ) 

- - A -* (y,x * )AT 1 (/£>(* * )/£>(* * )v (r) ,v (r) ) 

It follows from (1 .4) that the Gram matrix (/f r) (x * )f'[ r) (x * ) is nonsingular. Therefore, 

mineigval if {r) (x * )/£ } (* * ) = Ho > 0 
and 

(/£>(* • V[,P • ) v w . v w ) i H„(v w , v (r) ) - Mo(v. v) 

therefore, for p * A _1 {y,x * ) A/' 1 p,, > Owe obtain 

(L "' k)tm (u \ X* )v, v) s - J(v, v), V v : ( v, e, ) = 0 , / =r+l,...,m 
i.e. the condition type (1.5) for the dual problem (10.4) is satisfied. 

Also in view of/ (x* ) > 0, / = r + 1,..., m we have 
X; = - <p * ) * ib 1 ln(Jb A Cv,x * ) */(x * ) + 1 ) > 0,/ = r + 

which along with the linear independence of the gradients e, , / = r+ 1,..., m of the active constraints 
a, * 0, / = r + 1,..., m, and (10.10) comprise the second order optimality conditions for the dual 
problem. We completed the proof of Theorem 5. 

Corrollarv . The restriction % tk) (u) = I u r .i 3 ~ = u - = 0 ofthe dual fUnction 


67 



<p ^ 4) ( u ) to the manifold of the active constraints u, - 0, i = r + 1,..., m of the dual problem (10.4) 
is strongly concave if the conditions of Theorem 5 are fulfilled. 

Remark 6 . Statements 2) and 3) of Theorem 5 are in general not true even for a convex program- 


ming problem if the dual function <p(«) = inf L(x,u) is based on the Classical Lagrangian 

XC If 

L ( x, u ) for the initial problem. However, these results are valid for any k * k 0 Afj'.x* ) + £ u* and 
any y e intQ even for a nonconvex programming problem if the conditions ( 1 .3) - ( 1 . 5) and growing 
conditions (see [ Pol 92 ]p. 181 ) are fulfilled and the dual functions are based ontheMIDF 
F(x,y, u,k). 

Remark 7 . All statements of Theorem 5 hold for the Modified Interior Distance Function H (x, y,u,k) 
and the corresponding dual function 

h k (u) - mm{H(x t y t u,k) |xeR"> 
with the only difference that instead of (10.9) we have 

/»(*•) -/.O') -*,><•) 

for any y e int Q and k > Y, u*. 

Let y e int Q and * * k 0 A(^,x*) + E m/ also let (y. u, k ) € D (• ). We consider the dual 


problem (10.4) in the following form 

u * = argmax { F(x,y,u,k) \ F'(x t y,u,k ) = 0, u i 0 } (10.1 1) 

(P. Wolfe's duality [Wol 61]), then the MCM method 

x'* 1 :F'(x'* l ,y,u \k) * ) “ 0 (10 12) 

u" 1 - <1013) 

is an Interior Point Method for the dual problem (10. 1 1) and the estimation 

max( |x' +1 -x*|,l«'*‘ -u‘\ick l ACk,x*) |ii' (10.14) 
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holds true under the conditions of Theorem 2 for any {y,u°, k)eD(»). 

Finally note that the properties of the dual problem (10.4) (see Theorem 5) allows to improve signifi- 
cantly estimation (10. 14) by using smooth optimization methods for solving the dual problem (10.4) 
in a way which is similar to (see [Pol 92], pp. 206-208). 

11 Concluding Remarks . The Lagrange multipliers, the specific role of the barrier parameter 
along with the extension of the feasible set give rise to properties PI - P5, which makes MIDF 
F(x,y, u, k ) substantially different from DDF F(x , a ) - F(x,f 0 (y)). 

One can view both MIDF F ( x, y, u, k ) and IDF F ( x , a ) as smooth approximations for 
a nonsmooth and convex in x e Q : A (y, x ) > 0 function : 

A.(x,a) = max! -ln( a -f 0 (x)), -ln(a -f 0 (x) +f t (x)) i ■ K 
Assuming again In t = - « for t s 0 it is easy to see that solving the problem (1.1) is equivalent to 
solv ing the following unconstrained optimization problem 

X(x*,a) = mini \(x,a)/xe R") = -ln(a -f 0 (x‘)) 

Therefore the unconstrained minimizers 

$(y,u,k) = argmin { F(x,y,u,k)/x e R" } 
or 

x(tt) * argmin (F(x,«)/Jte R" } 

one can view as an approximations for the solution x* . The "quality" of approximations 
x{y t u y k) and £(a) to x* depends on the "quality" of the smooth approximations F(x,y,u,k) 
and F(x,a) for the function A ( x, a ). It is clear that these approximations are very different. 
In case of MIDF F ( x, y. u, k ) we have lim x(y,u,k) = x * for any fixed y € int Q and 

u-u * 


69 



k>T.u, . 


Moreover 

Urn [F&(y.u,k),y,u,k)-\(x\a)]=0 
* 

In case of F ( x , a) we have lim x (a) * x* only when a =/ 0 ) - a* =/ 0 ( x* ). However 

for any fixed "center" y € int Q we have 

lim F(x, a ) = » 

x-x* 

So, while the MIDF F(x,y, u, k ) is an exact smooth approximation for X (x, a ) at the 
solution x* for any fixed "center " y e intQ and any fixed k>L u*, IDF F (x, a) does not exist at x* 
and 

lim [F(x,a) - X(x*,a)] = +«*. 

X~*X * 

for any fixed "center" y € int D . 

The "quality" of the approximation F ( x, y, u, k ) has a substantial impact on the "quality" 

of the MCM (5.6) - (5.7). Method (5.6) - (5.7) converges only due to the Lagrange multipliers 

update. Therefore from some point on the function F(x, y, u, k) as well as its gradient and Hessian 

in x is not changing much after each Lagrange multipliers update, because lim u ' = u * . Being 

!**• 

well conditioned at the primal - dual solution ( x*. u ) the Hessian F” a ( x,y,u,k ) remains 
well conditioned in the neighborhood of ( x* u * ), so from some point on if x* ‘ 1 is "well" defined in 
terms of Newton method (see[Sm86]) for the system F t (x, y, «*,*)= 0" the new approximation x 1 
will be "well" defined for the system F^x.y, i/'\k) = <T and it will remain true up to the end of 
the process. Such x* we will call "hot" start ( see[MeIP95]) .Therefore from the "hot" start on it takes 
only 0 ( In In e' 1 ) Newton steps to find a new primal approximation and update the Lagrange 
multipliers. 
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Due to the estimation (6.4) every Lagrange multipliers update shrinks the distance from x* and 
vt to x and u by a factor 0 < y < 1 Therefore it takes only 0 ( In e*‘ ) • 0 ( In In e' 1 ) Newton's 
step from the "hot" start to the solution. 

To reach the "hot" start one can use any of path-following methods (see [NesN94]). 

The moment when the above mentioned acceleration begins depends on the triple (>\ u, k ) 
First of all the triple (y.u.k) has to be in D (• ) and let 's assume that y ,. * = c k l A(y, x') = y = 0.5, 
i. e. 

k 2 max { 2c A (y,x *), k Q A(y,x*) + E u * ) (11 1) 

The constam c > 0 depends on the norm | R(x *,y,0,k) | and o > 0 , the constant k^ depends 
( see Assertion 1) on A, > 0, min { u * | / = 1,..., r }, max { u, \ i - 1,..., r } and mineigval 

* )f[ r) (x * ). So the value of both c > 0 and * 0 > 0 depends on the "measure" of the nondege- 
neracy of the constrained optimization problem. 

Recall that in unconstrained optimization the properties of the smooth optimization methods 
depend very much on the condition of the Hessian at the solution. One can consider this condition 
as the "measure" of nondegeneracy of an unconstrained optimization problem. 

The analysis which was undertaken above, highlights the parameters which are responsible 
for the nondegeneracy of the constrained optimization problem and the way in which it influences the 
complexity of the NMCM. To complete the discussion we will make one more comment. 

Due to (1 1 . 1) the "hot" start depends very much not only on the "measure" of nondegeneracy 
of the constrained optimization problem, but also on the value of £ u t . The sum E u, is critical for 
the theory of MIDF ( see Theorem 1-5). On the other hand, E depends on the condition number 
of the feasible set, i.e. onit=r 0 R ^' 1 , where r 0 is the largest radius of the sphere inscribed in O and 
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Rq is the smallest radius of the sphere circumscribed around Q. 

Recall the the value E u' is crucial in the theory of exact penalty methods (see [Ber82 ]) 
However, in constrast to the exact penalty function, the MIDF is smooth and for some convex 
programming problems possesses the self concordant (see [NesN94] properties when u = e m . 

Therefore, the NMCM combines the self-concordant properties of the Shifted Interior 
Distance Function which guarantees the polynomial complexity of the Interior Points Methods, with 
important local PI - P5 properties. It allows us to speed up the process in the final stage, to make 
the process numerically more stable, and has the potential for improving the complexity bounds at 

least for nondegenerate convex problems (see [MelP95]). 

Due to (11.1) the moment when the process switches from the first phase to the second 
depends on the "measure" of nondegeneracy of the constrained optimization problem as well as on 
the condition number of the feasible set. 

There are still a number of questions, which have to be answered. 

First, we have to understand the behavior of the MCM in the absence of nondegeneracy 
assumptions. 

Second, the convergence under the fixed y € int D and k > is in no way a suggestion that 
both the "center" and the barrier parameter have to be fixed. To find the optimal strategy for changing 
the "center", the barrier parameter or both is another important issue. 

Third, to find conditions, which provide the convergence of the prox-type method (5. 12). 
Fourth, to specify the MIDF and MCM for LP and QP problems. 

Finally, the complexity of the Newton MCM is one of the important and still unanswered 

questions. 
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A-l 


Appendix 

A1 Proof of the Proposition 4 . We consider the Hessian F n ae (x\ y, u, k ) for a fixed y € int Q and 
k> 0. 


F"Jix,y, u*. *)-[(!- k- 'IVXAO) ~M*)T 7'oW - k~ V/W +A0) 

-a wr 'wm -/v*»] ' lxmJ . 

- [(1 - 'LuWofy) + (1 - k-'lu-my) -/o«r 7"oW + 

*- 'Zu-WM + A 0) -AM) - W'W -/o W'fflfW -A ’<*)) - *" '^"/.kj-fx) + 

AO) -/o(x)r '(W*) -/'Vx))],,. ^ 

- [( l - *- 'Zu'my) -A (x*))' V'o(x*)>7Vx*) + (l - 'SkWoM -/o(x*)r '/”o(x*) 

+ *- 'lu-,(A0> -/o(x*)r W'Xx*) -A M) r (W -/'o(x*))- 

*■ 'SuVAO) -A(x*))' V/’Vx*) - A "(X*))] 

- (AO) -A(x*r '[(AO) -A(x*))‘ Vo '(x*))7o '(x*) - ** '(tu’my) -A(x*)r 1 • 

(A '(x*))7o '(x*) + A "(x*) - *■ '(XnMA "(X*) + (AO) -A(x*)r ‘(*ix*A! •(*•)//, ’(x*) 

- (A '(X*)) r (la*/A ’(x*)) - (iM*,/Vx*)) r A '(x*)) + *■ '(£»*iXA '(x*)) r A '(x*)) 

- zu'j, "(x*)+ *- '(x»*aA "(x*n 

Taking into account the K-K-Ts relation f a (x* ) ■ Z u' f, (x ) we obtain 


F"(.x\y,u\k)-(f c (y) -/«(*• ))-'[£"(*•.»•) 

+ kf/oCv) -/o (x • ))■' (/£,(* ' )fu; fa * ' ) - (4CV) -/ 0 (x-))‘ , (^(x-))7(x')] 


-CAW -/,(* , )>-'[i£(* *.«•)♦ 


(a tv) -/ l (*')rw;('')tf,‘4( 1 ') -/;>(*•) 



* 
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A-2 


Using the same considerations we obtain 


.-2, 


u* k) - (40) -/ 0 (jt*)) [t"_ (**, u*» 

+ 20S0-) -/^))' W'<„ W «•/„(<•) - (r w (0) r <u* ( /' M (x*))]. 


A2. Formula for the MIDF Hessian F~(x,y,u,k) 


F"^x,y,u,k) lxmi -(l -** l Ia,VoM-/o(i)r 1 /'oW'o(i) + (l ’in,) 


• </oO>) -Ml))' ' /’Vi) - k~ '■ZuLkftx) +/ 0 0 ) ‘(*/"Xx) -/"o(i)) + 

*- '0Hk/(i) +/ 0 (j>) -/ 0 (i))- \kf(x) -/Vi)) r (*/’r(i) -/Vi)) 

- (A(v) -/ 0 (i)r ’cOoO) -/«(i)r V'oWvi) - *" W'. r (frA) 

+/"o(i) - ** ' WVD - Znrf/oM -/o(i)X*//i) +/oM -/o(i)r '/"i(i)) 

+ *- 'XnVoM -/o(i)X*jXi) +/o 0 ') -/o(i)r '/'Vi) + *“ WoM -/ 0 (i)) 


(*/Xi) +/«M -/o(i)r '(*//i) +/oM -/o(i)r ‘(*‘/ 7 (irXi) 


*fL*r*T f f 

- */'J(irxi) - */', r (iro(i) +/'o r (iyvi)n 

- WoW -/o(i)r'[(/oO') -/o(i)r V’ftiyvi) - 'wSinJi + 
/•Vi) - *' W'Vi) - S»/"Xi) + *■ '(Eir Vi) + 


*' 'av*/(i) +/oO’) — '(‘r/wxi) - */ wxi) - */•/< fvvi) +/•» wviM 


£\\~ t e* T i£\ff r£\ _ t j. r< T i£\f it\ 
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